ACD/LogP DB: What's New 


 
Request more information
   

ACD/LogP  DB


 


ACD/LogP DB

What's New

Version 5.0 to 6.0

ACD/LogP DB, v. 6.0, features improved prediction and training capabilities and offers a number of new exciting options:

  • Expanded internal database now contains 18,412 compounds due to the addition of the BioByte Star List.
  • Enhanced accuracy of prediction as a result of an improvement of the logP calculation algorithm for several classes of compounds and an addition of numerous new fragmental and interaction increments. The logP training dataset was substantially increased due to the addition of BioByte data.
  • Similarity Search: in addition to the existing structure and substructure searches, you can now search the database to discover compounds with similar chemical structures.
  • New property - Polar Surface Area (PSA) is added to the LogP prediction software
  • Enhanced user training: logP prediction engine can now be simultaneously trained with multiple databases to improve the accuracy of calculations for user's datasets and to reduce training time. You can also transfer your logP user database to the logP user fragments file to shorten training time even further.
  • Enhanced integration with ACD/ChromManager: experimental chromatographic data can be transferred into an ACD/LogP DB training database, transforming retention times into logP/logD data.
  • ACD/LogP DB databases can now be viewed in table format.
  • SDfile Import/Export options can be customized.
Improved Algorithm

As part of an ongoing effort to improve the scope and predictive accuracy of ACD/LogP DB, in early 2001 Advanced Chemistry Development, Inc. entered into a strategic partnership with BioByte Corp. Under the terms of the agreement, we were able to use BioByte's Star List database of measured logP values to improve the accuracy and scope our logP prediction.

Additionally, the content of the Star List was added to the Internal Database of LogP DB, with primary literature citation and credit to BioByte.

As a result of this collaboration, we were able to review and add thousands of logP values to the ACD/LogP DB training set. The algorithm for version 6.0 of ACD/LogP DB now uses more information about different chemical classes, additive groups and aromatic/aliphatic interactions:

       Ver 1.0-4.0    Ver. 4.5    Ver. 5.0    Ver. 6.0
Additive groups    531    830    859    1240
Aliphatic Interactions    652    873    875    1506
Aromatic Interactions    996    1019    1589    2433
Total    2179    2722    3323    5179

We were able to successfully predict the logP values for the compounds from the BioByte Star List using the earlier versions of ACD/LogP DB. However, the addition of new groups and interactions substantially improved the quality of prediction for version 6.0, as shown below. As reflected by the statistical data, the standard error of logP prediction decreased from 0.351 for version 5.0 to 0.217 for version 6.0.

ACD/LogP version 5.0ACD/LogP version 6.0
SE -Standard Error

Equation:  logPexp = a · logPcalc + b

version a b N r2 SE
5.0 0.872 0.281 10,862 0.880 0.351
6.0 0.961 0.095 10,897 0.955 0.217

Error distribution of the logP prediction also substantially improved, as demonstrated below using the same Star List testing set of 10,897 compounds. Over 75% of the predictions were within 0.3 units from the experimental value for predictions made by ACD/LogP DB version 6.0, as compared to 63.8% for version 5.0.

Difference in LOG units Number of compounds Percent of calculated Number of compounds Percent of calculated
Version 5.0 Version 6.0
All database 10,897     10,897    
Calculated 10,862     10,897    
< 14.0 10,862 100.00% - -
< 9.0 10,861 99.99% - -
< 6.0 10,850 99.89% - -
< 5.0 10,846 99.85% - -
< 4.0 10,842 99.82% 10,897 100.00%
< 3.0 10,832 99.72% 10,895 99.98%
< 2.0 10,718 98.67% 10,875 99.80%
< 1.5 10,534 96.98% 10,827 99.36%
< 1.0 10,002 92.08% 10,596 97.24%
< 0.7 9,263 85.28% 10,131 92.97%
< 0.5 8,342 76.80% 9,480 87.00%
< 0.3 6,826 62.84% 8,215 75.39%
< 0.2 5,638 51.91% 7,069 64.87%
< 0.1 3,928 36.16% 5,163 47.38%
Not calculated 35     0    

Version 4.5 to 5.0

Database

The database has increased in total number of entries, by 23%. It now contains 14606 structures with one or more literature references and experimentally measured values.

General Interface

The system can now be trained without opening database, the user can just specify the location of the previously created database through the Results Window:

Internal and user-defined databases can be examined now with the Multiple Record View. A right-click gives you a pop-up menu that lets you select display options for zoom, data fields displayed and font.

You can upgrade the indices of database records for more rapid substructure search.

   
Algorithm

We have recalculated dozens of old, not very reliable, increments. An improved algorithm for the prediction of LogP uses more information about additive groups and aromatic interactions.

    Ver 1.0-4.0 Ver. 4.5 Ver. 5.0
Additive groups 531 830 859
Aliphatic Interactions 652 873 875
Total 2179 2722 3323

Available Statistics for Version 1.0-4.0
Training Set, N = 3688
Standard Deviation (StD) = 0.21
R = 0.99
Equation is unavailable

Available Statistics for Version 4.5
Training Set, N = 4540
StD = 0.2345
R = 0.9893
LogPexp = 0.9753*LogPpred + 0.0586

Available Statistics for Version 5.0
Training Set, N = 5214
StD = 0.2392
R = 0.9882
LogPexp = 0.9757*LogPpred + 0.0569

TOP

This page was last updated 31 October 2007
 

  Products | Solutions | Support
Online Services | Resources
About Us | Downloads | Events
Site Map | Contact Us
 

 
Copyright © 1996 - 2008 Advanced Chemistry Development     All rights reserved