CheD

Chemical Database Management System
Databases that operate with chemical information

CheD was developed to store, view, and search chemical structures and infrared, nuclear magnetic resonance, and mass spectra.

For: chemical plants, universities, research centers.

The software is intended for engineers and scientists working in chemistry

CheD was developed to store, view, and search chemical structures and infrared, nuclear magnetic resonance, and mass spectra

The software runs on the personal computer for the storage, retrieval and processing of chemical information. The program can work both as stand-alone application and with especially written WWW server and some SQL servers such as Oracle, Interbase, MS SQL. New kinds of data fields are introduced: arrays for spectral information store, WWW and database links, and user-defined functions. CheD has an open architecture for customized types, controls, services to be added. WWW server for chemical data exposure is created as a distributive, which may be installed at Windows NT or 95 platform in some minutes

Some modifications were introduced into the previously described Centroid algorithm of diversity sorting, which uses cosine similarity metric. The modified algorithm is suitable for working with large databases on personal computers. For example, less than 9 hours are required (Pentium III, 800 MHz) for diversity sorting of the database with the size greater than million of records. The problem of selection of new compounds to the existing collection is examined to reach the maximum diversity of the collection. A new algorithm for the selection of heterocyclic compounds is described

The software was applied at Uniroyal Chemical, ChemDiv, TimTec, Chemical Block International, Harvard University; many institutes of the Russian Academy of Sciences, the Moscow University, and many research institutes.

CheD program package for chemistry

  • Basic version
  • CheD is 32-bit Windows program, used for creation and supporting of chemical databases.
  • With a CheD's tools you are capable to create your own database using a variety of field types.
  • You can arrange database for viewing and printing.
  • Any record is easy to be found with CheD's search functions.
  • CheD has many routines to arrange, sort, verify, compare, merge databases
  • All search and comparison processes are very fast with CheD.
  • CheD supports the interfacing with the SDF and JCAMP formats for Chemical and Spectral data exchange, which make CheD compatible with related programs.
  • Original Structure Editor makes drawing of structure easy and pleasant
  • CheD is capable to operate with large databases - up to 10,000,000 records.
  • Due to standard Windows style it is easy to communicate with other applications.
  • With prediction tools available your database can be learned to predict unknown data.

Optional software

NMR-spectra processing includes:

  • Element properties - list of nuclei magnetic properties
  • Spectrum calculation - predict NMR peak table spectrum using database learned
  • Check DB content - compare database spectra with predicted ones
  • Standard recalculation - recalculate spectrum to another standard
    HNMR processing and assignment - loads instrumental FID, processes FT, Phase correction, Integral calculation, peak peaking, manual assignment of peaks, print assigned spectrum.
  • Prediction 1H, 13C and 31P chemical shift
  • Databases of 4,000 13C and 4,000 31P assigned spectra

The screenshot of the Chemical and Spectral data appearance is shown here.

IR-spectra processing includes:

  1. Spectrum processing- loads instrumental spectrum, performs base line correction, spectrum normalization, spectra subtraction.
  2. Peaktable generation - calculates peak table from spectral curve
  3. Spectrum edit - edits spectrum by points
  4. Contour decomposition - decomposes spectral curve into Lorentz/Gauss contours

MASS-spectra processing includes:

  1. JCAMP(Wiley)->JCAMP-Link - execute files transformation
  2. Import Instrumental files - loads instrumental Finnigan *.pic and Vector *.txt files
  3. Intensities correction - correct spectrum intensities using calibration table
  4. Export in text exchange format - saves spectrum in simple text file
  5. Peak's isotopic structure - calculates intensities of isotopic components of mass peak
  6. Exact mass - calculate exact mass for given isotopic composition
  7. Element properties - isotopes exact mass table
  8. Composition of peak - proposes brutto-formulas for given mass
  9. Expert - analyses mass spectrum using possible fragmentation
  10. Calculator - calculates in real time masses and isotopic distribution of selected fragment in structure. Saves assigned fragments in spectrum into teaching database.
  11. Look assignment database - browser of database learned
  12. Spectrum analyzing - analyzes spectrum using database learned

Routines for synthesis planning:

  1. Material balance- to calculate amount and volume of compounds, including and obtained in a synthesis. Yields and excess are taken into consideration.
  2. Calculates brutto-formula from element composition.
  3. Units of concentrations re-calculation.
  4. Dilution of solvents

Diversity, chemical structures and combinatorial chemistry:

  1. Diversity sorting - sort a data set by diversity with very fast algorithm (full sorting of 1,000,000 compounds per 9 hours).
  2. Compound Selection - select compounds from external data set, the diversity being maximized
  3. Similarity calculations - for each compound in initial data set calculates similarities with selected data set.
  4. SDF file error analyzing - checks for possible errors in SDF file.
  5. SDF file processing - edition and search in a SDF file without its conversion to database.
  6. SDF file creation - converts *.mol files from a selected directory to a SDF file.
  7. SDF file sorting - sort records in SDF file accordingly to diversity.
  8. Heterocyclic fragments generation - generate a database with non-substituted heterocycles from a dataset. Occurrence of heterocycles in the dataset is calculated. Sorting by diversity with heterocyclic fragments.
  9. Virtual compound generation - create combinatorial data set by radical recombination and from scaffold structures.
  10. Interface with SQL servers - Oracle, MS SQL, Interbase
  11. Upload and download data for above-mentioned SQL servers.

WEB server extension

Enable to publish chemical data - structures, spectra, also as text and pictures over WWW. A client can select data with structure/substructure search and traditional text search. One may compare content of local SDF with remote database very fast - 5 minutes are required to compare 1,000,000 remote databases with 10,000 SDF chemical structures.

Here you can find the structural scheme of CheD components interaction.

Software and hardware required

CheD can run at any computer with installed Windows 95/NT/2000. But to achieve appropriate performance, computer requires:

  • Processor Pentium 133 or better.
  • 32Mb of RAM to run CheD itself. To manipulate with a database of 100,000 chemical structures, 24Mb of RAM should be added.
  • 60Mb of free disk space for installation. To store 1,000,000 structures without additional data, one needs approximately 400Mb of free disk space.

Page from site http://eng.interface.ru
Source web address http://eng.interface.ru/home.asp?artId=404