25.12.2014 Views

Best practices for chemical data curation and QSAR model ...

Best practices for chemical data curation and QSAR model ...

Best practices for chemical data curation and QSAR model ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Major components of <strong>QSAR</strong> <strong>model</strong>ing<br />

• Target properties (dependent variable)<br />

– Continuous (e.g., IC50)<br />

– Categorical unrelated (e.g., different pharmacological classes)<br />

– Categorical related (e.g., subranges described as classes)<br />

• Descriptors (or independent variables)<br />

– Continuous (allows distance based similarity)<br />

– Categorical related (allows distance based similarity)<br />

– Categorical unrelated (require special similarity metrics)<br />

• Correlation methods (with <strong>and</strong> w/o variable selection)<br />

– Linear (e.g., LR, MLR, PCR, PLS)<br />

– Non-linear (e.g., kNN, RP, ANN, SVM)<br />

• Validation <strong>and</strong> prediction<br />

<strong>QSAR</strong><br />

Pill<br />

– Internal (training set) vs. external (test set) vs. independent evaluation<br />

set<br />

• Examples of applications <strong>and</strong> pitfalls

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!