This tape contains three files nciopen.mol.Z connection tables for 126705 compounds in MDL's SDFile format ( compressed with the unix compress command ) nciopen.example an annotated example of a connection table nciopen.readme this text The structures in this file are of the compounds in the NCI database on July 27, 1993 that met three conditions: 1) a complete connection table existed 2) the compound was not covered by a proprietary agreement 3) a CAS registry number was given for the compound The compressed file takes up about 11 Meg, the uncompressed file about 180 Meg. What information is in? 1) The atomic symbol for each non-hydrogen node and the formal charge ( if any ) 2) the bonds and bond order ( single, double, triple, aromatic ) 3) two data fields: the NSC number ( NCI's internal ID ). field name NSC the CAS registry number. field name CAS_RN what information is NOT in? 1) no sterochemistry of any kind ( atom centered or bond centered ) is given. 2) no coordinates are given; they are all just set to zero. Caveats and Disclaimers The collection and organization of this information was paid for by federal government funds. The files are NOT copyrighted and can be used by anyone for whatever purpose they choose. Although we believe the information in these files is correct, we can not guarantee either that it is correct or that it is suitable for any particular purpose. For more information On the MDL SDFile format: Dalby, A., et. al. J. Chem. Inf. Comput. Sci. 32:244-255 (1992) On a specific compound: Do a literature search using the CAS registry number. Please, we simply don't have the resources to answer requests for information on specific compounds. On other questions: Dr. Dan Zaharevitz email - zaharevitz@dtpvx2.ncifcrf.gov phone - (301)496-8747 FAX - (301)496-8333