25.12.2014 Views

Best practices for chemical data curation and QSAR model ...

Best practices for chemical data curation and QSAR model ...

Best practices for chemical data curation and QSAR model ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

But … the unbearable lightness of <strong>model</strong> building<br />

<strong>for</strong> training sets…<br />

3<br />

2.5<br />

Predicted LogED50<br />

2<br />

1.5<br />

1<br />

Training<br />

Linear (Training)<br />

0.5<br />

0<br />

0 1 2 3 4<br />

Actual LogED50 (ED50 = mM/kg)<br />

…leads to unacceptable prediction accuracy.<br />

EXTERNAL TEST SET PREDICTIONS<br />

Observed<br />

9<br />

7<br />

5<br />

y = 0.5958x + 2.3074<br />

R 2 = 0.2135<br />

Observed<br />

9<br />

7<br />

5<br />

y = 0.4694x + 2.9313<br />

R 2 = 0.1181<br />

3<br />

3 5 7 9<br />

3<br />

3 5 7 9<br />

Predicted<br />

Predicted

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!