This is a text version of the MOPAC7 manual produced from the LaTeX (actually DVI) version of a manual with a program "dvi2tty". It is really useless, since it has long lines (132 characters), and all the mathematical expressions, tables, etc., are nonexistent or messed up. However, if for some reason you cannot process LaTeX manual from the mopac7-man.tar.Z or cannot print the PostScript, then here it is. Do not use is please... Jan Labanowski jkl@ccl.net ------------------------------------------------ MOPAC Manual (Seventh Edition) Dr James J. P. Stewart PUBLIC DOMAIN COPY (NOT SUITABLE FOR PRODUCTION WORK) January 1993 ___________________________________________________________________ This document is intended for use by developers of semiempirical programs and software. It is not intended for use as a guide to MOPAC. All the new functionalities which have been donated to the MOPAC project during the period 1989-1993 are included in the program. Only minimal checking has been done to ensure confor- mance with the donors' wishes. As a result, this program should not be used to judge the quality of programming of the donors. This version of MOPAC-7 is not supported, and no attempt has been made to ensure reliable performance. This program and documentation have been placed entirely in the public domain, and can be used by anyone for any purpose. To help developers, the donated code is packaged into files, each file representing one donation. In addition, some notes have been added to the Manual. These may be useful in understanding the donations. If you want to use MOPAC-7 for production work, you should get the copyrighted copy from the Quantum Chemistry Program Exchange. That copy has been carefully written, and allows the donors' contributions to be used in a full, production-quality program. ___________________________________________________________________ Contents Description of MOPAC 1 1.1 Summary of MOPAC capabilities : : : : : : : : : : : : : : 1 1.2 Copyright status of MOPAC : : : : : : : : : : : : : : : : : 2 1.3 Porting MOPAC to other machines : : : : : : : : : : : : : : : 2 1.4 Relationship of AMPAC and MOPAC : : : : : : : : : : : : : 3 1.5 Programs recommended for use with MOPAC : : : : : : : : : : : 3 1.6 The data-file : : : : : : : : : : : : : : : : : : : : : @ 1.6.1 Example of data for ethylene : : : : : : : : : : : : : : 5 1.6.2 Example of data for polytetrahydrofuran : : : : : : : : : : : 6 2 Keywords 9 2.1 Specification of keywords : : : : : : : : : : : : : : : : 9 2.2 Full list of keywords used in MOPAC : : : : : : : : : : : : : : 9 2.3 Definitions of keywords : : : : : : : : : : : : : : : : : : 12 2.4 Keywords that go together : : : : : : : : : : : : : : : : 39 3 Geometry specification 41 3.1 Internal coordinate definition : : : : : : : : : : : : : : : : 41 3.1.1 Constraints : : : : : : : : : : : : : : : : : : 41 3.2 Gaussian Z-matrices : : : : : : : : : : : : : : : : : : : 42 3.3 Cartesian coordinate definition : : : : : : : : : : : : : : : 43 3.4 Conversion between various formats : : : : : : : : : : : : : : 43 3.5 Definition of elements and isotopes : : : : : : : : : : : : : : : 44 3.6 Examples of coordinate definitions : : : : : : : : : : : : : : : 46 4 Examples 49 4.1 MNRSD1 test data file for formaldehyde : : : : : : : : : : : : 49 4.2 MOPAC output for test-data file MNRSD1 : : : : : : : : : : : : 50 5 Testdata 55 5.1 Data file for a force calculation : : : : : : : : : : : : : : : 55 5.2 Results file for the force calculation : : : : : : : : : : : : : : : 56 5.3 Example of reaction path with symmetry : : : : : : : : : : : : : 63 6 Background 65 6.1 Introduction : : : : : : : : : : : : : : : : : : : : @ 6.2 AIDER : : : : : : : : : : : : : : : : : : : : : : @ 6.3 Correction to the peptide linkage : : : : : : : : : : : : : : 66 6.4 Level of precision within MOPAC : : : : : : : : : : : : : : 67 6.5 Convergence tests in subroutine ITER : : : : : : : : : : : : : 69 6.6 Convergence in SCF calculation : : : : : : : : : : : : : : : 69 6.7 Causes of failure to achieve an SCF : : : : : : : : : : : : : : : 70 _________________________________________________________CONTENTS____________ 6.8 Torsion or dihedral angle coherency : : : : : : : : : : : : : : : 71 6.9 Vibrational analysis : : : : : : : : : : : : : : : : : : : 71 6.10 A note on thermochemistry : : : : : : : : : : : : : : : : : 71 6.10.1 Basic Physical Constants : : : : : : : : : : : : : : 71 6.10.2 Thermochemistry from ab initio MO methods : : : : : : : : : 72 6.11 Reaction coordinates : : : : : : : : : : : : : : : : : : : 76 6.12 Sparkles : : : : : : : : : : : : : : : : : : : : : : @ 6.13 Mechanism of the frame in FORCE calculation : : : : : : : : : : 86 6.14 Configuration interaction : : : : : : : : : : : : : : : : 87 6.15 Reduced masses in a force calculation : : : : : : : : : : : : : 91 6.16 Use of SADDLE calculation : : : : : : : : : : : : : : : : : 91 6.17 How to escape from a hilltop : : : : : : : : : : : : : : : : 93 6.17.1 EigenFollowing : : : : : : : : : : : : : : : : : 94 6.17.2 Franck-Condon considerations : : : : : : : : : : : : : : 96 6.18 Outer Valence Green's Function : : : : : : : : : : : : : : : 97 6.18.1 Example of OVGF calculation : : : : : : : : : : : : : 98 6.19 COSMO (Conductor-like Screening Model) : : : : : : : : : : : : 99 6.20 Solid state capability : : : : : : : : : : : : : : : : : : : 99 7 Program 101 7.1 Main geometric sequence : : : : : : : : : : : : : : : : : 101 7.2 Main electronic flow : : : : : : : : : : : : : : : : : : : 102 7.3 Control within MOPAC : : : : : : : : : : : : : : : : : 102 7.3.1 Subroutine GMETRY : : : : : : : : : : : : : : : : 103 8 Error messages produced by MOPAC 105 9 Criteria 113 9.1 SCF criterion : : : : : : : : : : : : : : : : : : : : : 113 9.2 Geometric optimization criteria : : : : : : : : : : : : : : : 113 10 Debugging 117 10.1 Debugging keywords : : : : : : : : : : : : : : : : : : : 117 11 Installing MOPAC 121 11.1 ESP calculation : : : : : : : : : : : : : : : : : : : : 124 A Names of FORTRAN-77 files 127 B Subroutine calls in MOPAC 129 C Description of subroutines 137 D Heats of formation 147 E References 149 CONTENTS___________________________________________________________ o New Functionalities: - Michael B. Coolidge, The Frank J. Seiler Research Laboratory, U.S. Air Force Academy, CO 80840, and James J. P. Stewart, Stewart Computational Chemistry, 15210 Paddington Circle, Colorado Springs, CO 80921-2512. (The Air Force code was obtained under the Freedom of Information Act) Symmetry is used to speed up FORCE calculations, and to facilitate the analysis of molecular vibrations. - David Danovich, The Fritz Haber Research Center for Molecular Dynamics, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel. Ionization potentials are corrected using Green's Function techniques. The resulting I.P.s are generally more accurate than the conventional I.P.s. The point-group of the system is identified, and molecular orbitals are characterized by irreducible representation. - Andreas Klamt Bayer AG, Q18, D-5090 Leverkusen-Beyerwerk, Germany. A new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient has been added. o Existing Functionalities: - Victor I. Danilov, Department of Quantum Biophysics, Academy of Sciences of the Ukraine, Kiev 143, Ukraine. Edited the MOPAC 7 Manual, and provided the basis for Section 6.17.2, on excited states. - Henry Kurtz and Prakashan Korambath, Department of Chemistry, Mem- phis State University, Memphis TN 38152. The Hyperpolarizability calculation, originally written by Prof Kurtz, has been im- proved so that frequency dependent non-linear optical calculations can be performed. (Prakashan Korambath, dissertation research) - Frank Jensen, Department of Chemistry, Odense Universitet, Campusvej 55, DK- 5230 Odense M, Denmark. The efficiency of Baker's EF routine has been improved. - John M. Simmie, Chemistry Department, University College, Galway, Ireland. The MOPAC Manual has been completely re-formatted in the LaTeX document prepa- ration system. Equations are now much easier to read and to understand. - Jorge A. Medrano, 5428 Falcon Ln., West Chester, OH 45069, and Roberto Bochicchio (Universidad de Buenos Aires). The BONDS function has been extended to allow free valence and other quantities to be calculated. - George Purvis III, CAChe Scientific, P.O. Box 500, Delivery Station 13-400, Beaverton, OR 97077. The STO-6G Gaussian expansion of the Slater orbitals has been expanded to Principal Quantum Number 6. These expansions are used in analytical derivative calculations. o Bug-reports/bug-fixes: - Victor I. Danilov, Department of Quantum Biophysics, Academy of Sciences of the Ukraine, Kiev 143, Ukraine. Several faults in the multi-electron configuration interaction were identified, and rec- ommendations made regarding their correction. _________________________________________________________CONTENTS____________ Chapter 1 Description of MOPAC MOPAC is a general-purpose semi-empirical molecular orbital package for the study of chemical structures and reactions. The semi-empirical Hamiltonians MNDO, MINDO/3, AM1, and PM3 are used in the electronic part of the calculation to obtain molecular orbitals, the heat of formation and its derivative with respect to molecular geometry. Using these results MOPAC calculates the vibrational spectra, thermodynamic quantities, isotopic substitution effects and force constants for molecules, radicals, ions, and polymers. For studying chemical reactions, a transition state location routine and two transition state optimizing routines are available. For users to get the most out of the program, they must understand how the program works, how to enter data, how to interpret the results, and what to do when things go wrong. While MOPAC calls upon many concepts in quantum theory and thermodynamics and uses some fairly advanced mathematics, the user need not be familiar with these specialized topics. MOPAC is written with the non-theoretician in mind. The input data are kept as simple as possible so users can give their attention to the chemistry involved and not concern themselves with quantum and thermodynamic exotica. The simplest description of how MOPAC works is that the user creates a data-file which describes a molecular system and specifies what kind of calculations and output are desired. The user then commands MOPAC to carry out the calculation using that data-file. Finally the user extracts the desired output on the system from the output files created by MOPAC. 1. This is the "sixth edition". MOPAC has undergone a steady expansion since its first release, and users of the earlier editions are recommended to familiarize themselves with the changes which are described in this manual. If any errors are found, or if MOPAC does not perform as described, please contact Dr. James J. P. Stewart, Frank J. Seiler Research Laboratory, U.S. Air Force Academy, Colorado Springs, CO 80840-6528. 2. MOPAC runs successfully on normal CDC, Data General, Gould, and DEC computers, and also on the CDC 205 and CRAY-XMP "supercomputers". The CRAY version has been partly optimized to take advantage of the CRAY architecture. Several versions exist for microcomputers such as the IBM PC-AT and XT, Zenith, etc. 1.1 Summary of MOPAC capabilities 1. MNDO, MINDO/3, AM1, and PM3 Hamiltonians. 2. Restricted Hartree-Fock (RHF) and Unrestricted Hartree-Fock (UHF) methods. 3. Extensive Configuration Interaction (a) 100 configurations (b) Singlets, Doublets, Triplets, Quartets, Quintets, and Sextets ___________________________________________________Description_of_MOPAC________________ (c) Excited states (d) Geometry optimizations, etc., on specified states 4. Single SCF calculation 5. Geometry optimization 6. Gradient minimization 7. Transition state location 8. Reaction path coordinate calculation 9. Force constant calculation 10. Normal coordinate analysis 11. Transition dipole calculation 12. Thermodynamic properties calculation 13. Localized orbitals 14. Covalent bond orders 15. Bond analysis into sigma and pi contributions 16. One dimensional polymer calculation 17. Dynamic Reaction Coordinate calculation 18. Intrinsic Reaction Coordinate calculation 1.2 Copyright status of MOPAC At the request of the Air Force Academy Law Department the following notice has been placed in MOPAC. Notice of Public Domain nature of MOPAC. "This computer program is a work of the United States Government and as such is not subject to protection by copyright (17 U.S.C. # 105.) Any person who fraudulently places a copyright notice or does any other act contrary to the provisions of 17 U.S. Code 506(c) shall be subject to the penalties provided therein. This notice shall not be altered or removed from this software and is to be on all reproductions." I recommend that a user obtain a copy by either copying it from an existing site or ordering an `official' copy from the Quantum Chemistry Program Exchange, (QCPE), Department of Chem- istry, Indiana University, Bloomington, Indiana, 47405. The cost covers handling only. Contact the Editor, Richard Counts, at (812) 855-4784 for further details. 1.3 Porting MOPAC to other machines MOPAC is written for the DIGITAL VAX computer. However, the program has been written with the idea that it will be ported to other machines. After such a port has been done, the new program should be given the version number 6.10, or, if two or more versions are generated, 6.20, 6.30, etc. To validate the new copy, QCPE has a test-suite of calculations. If all tests are passed, within the tolerances given in the tests, then the new program can be called a valid version of MOPAC 6. Insofar as is practical, the mode of submission of a MOPAC job should be preserved, e.g., 1.4_Relationship_of_AMPAC_and_MOPAC____________________________________________________ (prompt) MOPAC [...] Any changes which do not violate the FORTRAN-77 conventions, and which users believe would be generally desirable, can be sent to the author. 1.4 Relationship of AMPAC and MOPAC In 1985 MOPAC 3.0 and AMPAC 1.0 were submitted to QCPE for distribution. At that time, AMPAC differed from MOPAC in that it had the AM1 algorithm. Additionally, changes in some MNDO parameters in AMPAC made AMPAC results incompatable with MOPAC Versions 1-3. Subsequent versions of MOPAC, in addition to being more highly debugged than Version 3.0, also had the AM1 method. Such versions were compatible with AMPAC and with versions 1-3 of MOPAC. In order to avoid confusion, all versions of MOPAC after 3.0 include journal references so that the user knows unambiguously which parameter sets were used in any given job. Since 1985 AMPAC and MOPAC have evolved along different lines. In MOPAC I have endeav- oured to provide a highly robust program, one with only a few new features, but which is easily portable and which can be relied upon to give precise, if not very exciting, answers. At Austin, the functionality of AMPAC has been enhanced by the research work of Prof. Dewar's group. The new AMPAC 2.1 thus has functionalities not present in MOPAC. In publications, users should cite not only the program name but also the version number. Commercial concerns have optimized both MOPAC and AMPAC for use on supercomputers. The quality of optimization and the degree to which the parent algorithm has been preserved differs between MOPAC and AMPAC and also between some machine specific versions. Different users may prefer one program to the other, based on considerations such as speed. Some modifications of AMPAC run faster than some modifications of MOPAC, and vice versa, but if these are modified versions of MOPAC 3.0 or AMPAC 1.0, they represent the programming prowess of the companies doing the conversion, and not any intrinsic difference between the two programs. Testing of these large algorithms is difficult, and several times users have reported bugs in MOPAC or AMPAC which were introduced after they were supplied by QCPE. Cooperative Development of MOPAC MOPAC has developed, and hopefully will continue to develop, by the addition of contributed code. As a policy, any supplied code which is incorporated into MOPAC will be described in the next release of the Manual, and the author or supplier acknowledged. In the following release only journal references will be retained. The objective is to produce a good program. This is obviously not a one-person undertaking; if it was, then the product would be poor indeed. Instead, as we are in a time of rapid change in computational chemistry, a time characterized by a very free exchange of ideas and code, MOPAC has been evolving by accretion. The unstinting and generous donation of intellectual effort speaks highly of the donors. However, with the rapid commercialization of computational chemistry software in the past few years, it is unfortunate but it seems unlikely that this idyllic state will continue. 1.5 Programs recommended for use with MOPAC MOPAC is the core program of a series of programs for the theoretical study of chemical phenom- ena. This version is the sixth in an on-going development, and efforts are being made to continue its further evolution. In order to make using MOPAC easier, five other programs have also been written. Users of MOPAC are recommended to use all four programs. Efforts will be made to continue the development of these programs. ___________________________________________________Description_of_MOPAC________________ HELP HELP is a stand-alone program which mimics the VAX HELP function. It is intended for users on UNIX computers. HELP comes with the basic MOPAC 6.00, and is recommended for general use. DRAW DRAW, written by Maj. Donn Storch, USAF, and available through QCPE, is a powerful editing program specifically written to interface with MOPAC. Among the various facilities it offers are: 1. The on-line editing and analysis of a data file, starting from scratch or from an existing data file, an archive file, or from a results file. 2. The option of continuous graphical representation of the system being studied. Several types of terminals are supported, including DIGITAL, TEKTRONIX, and TERAK terminals. 3. The drawing of electron density contour maps generated by DENSITY on graphical devices. 4. The drawing of solid-state band structures generated by MOSOL. 5. The sketching of molecular vibrations, generated by a normal coordinate analysis. DENSITY DENSITY, written by Dr. James J. P. Stewart, and available through QCPE, is an electron- density plotting program. It accepts data-files directly from MOPAC, and is intended to be used for the graphical representation of electron density distribution, individual M.O.'s, and difference maps. MOHELP MOHELP, also available through QCPE, is an on-line help facility, written by Maj. Donn Storch and Dr. James J. P. Stewart, to allow non-VAX users access to the VAX HELP libraries for MOPAC, DRAW, and DENSITY. MOSOL MOSOL (Distributed by QCPE) is a full solid-state MNDO program written by Dr. James J. P. Stewart. In comparison with MOPAC, MOSOL is extremely slow. As a result, while geometry optimization, force constants, and other functions can be carried out by MOSOL, these slow calculations are best done using the solid-state facility within MOPAC. MOSOL should be used for two or three dimensional solids only, a task that MOPAC cannot perform. 1.6 The data-file This section is aimed at the complete novice _ someone who knows nothing at all about the structure of a MOPAC data-file. First of all, there are at most four possible types of data-files for MOPAC, but the simplest data-file is the most commonly used. Rather than define it, two examples are shown below. An ex- planation of the geometry definitions shown in the examples is given in the chapter "GEOMETRY SPECIFICATION". 1.6_The_data-file____________________________________________________________ 1.6.1 Example of data for ethylene Line 1 : UHF PULAY MINDO3 VECTORS DENSITY LOCAL T=300 Line 2 : EXAMPLE OF DATA FOR MOPAC Line 3 : MINDO/3 UHF CLOSED-SHELL D2D ETHYLENE Line 4a: C Line 4b: C 1.400118 1 Line 4c: H 1.098326 1 123.572063 1 Line 4d: H 1.098326 1 123.572063 1 180.000000 0 2 1 3 Line 4e: H 1.098326 1 123.572063 1 90.000000 0 1 2 3 Line 4f: H 1.098326 1 123.572063 1 270.000000 0 1 2 3 Line 5 : As can be seen, the first three lines are textual. The first line consists of keywords (here seven keywords are shown). These control the calculation. The next two lines are comments or titles. The user might want to put the name of the molecule and why it is being run on these two lines. These three lines are obligatory. If no name or comment is wanted, leave blank lines. If no keywords are specified, leave a blank line. A common error is to have a blank line before the keyword line: this error is quite tricky to find, so be careful not to have four lines before the start of the geometric data (lines 4a-4f in the example). Whatever is decided, the three lines, blank or otherwise, are obligatory. In the example given, one line of keywords and two of documentation are shown. By use of keywords, these defaults can be changed. Modifying keywords are +, &, and SETUP. These are defined in the KEYWORDS chapter. The following table illustrates the allowed combinations: Line 1 Line 2 Line 3 Line 4 Line 5 Setup used Keys Text Text Z-matrix Z-matrix not used Keys + Keys Text Text Z-matrix not used Keys + Keys + Keys Text Text not used Keys & Keys Text Z-matrix Z-matrix not used Keys & Keys & Keys Z-matrix Z-matrix not used Keys SETUP Text Text Z-matrix Z-matrix 1 or 2 lines used Keys + Keys SETUP Text Text Z-matrix 1 line used Keys & Keys SETUP Text Z-matrix Z-matrix 1 line used No other combinations are allowed. The proposed use of the SETUP option is to allow a frequently used set of keywords to be defined by a single keyword. For example, if the default criteria are not suitable, SETUP might contain: " SCFCRT=1.D-8 SHIFT=30 ITRY=600 GNORM=0.02 ANALYT " " " The order of usage of a keyword is: Line 1 > Line 2 > Line 3. Line 1 > SETUP. Line 2 > SETUP. SETUP > built in default values. The next set of lines defines the geometry. In the example, the numbers are all neatly lined up; this is not necessary, but does make it easier when looking for errors in the data. The geometry is defined in lines 4a to 4f; line 5 terminates both the geometry and the data-file. Any additional data, for example symmetry data, would follow line 5. Summarizing, then, the structure for a MOPAC data-file is: ___________________________________________________Description_of_MOPAC________________ Line 1 Keywords. (See chapter 2 on definitions of keywords) Line 2 Title of the calculation, e.g. the name of the molecule or ion. Line 3 Other information describing the calculation. Lines 4 Internal or cartesian coordinates (See chapter on specification of geometry) Line 5 Blank line to terminate the geometry definition. Other layouts for data-files involve additions to the simple layout. These additions occur at the end of the data-file, after line 5. The three most common additions are: o Symmetry data: This follows the geometric data, and is ended by a blank line. o Reaction path: After all geometry and symmetry data (if any) are read in, points on the reaction coordinate are defined. o Saddle data: A complete second geometry is input. The second geometry follows the first geometry and symmetry data (if any). 1.6.2 Example of data for polytetrahydrofuran The following example illustrates the data file for a four hour polytetrahydrofuran calculation. As you can see the layout of the data is almost the same as that for a molecule, the main difference is in the presence of the translation vector atom "Tv". Line 1 :T=4H Line 2 : POLY-TETRAHYDROFURAN (C4 H8 O)2 Line 3 : Line 4a: C 0.000000 0 0.000000 0 0.000000 0 0 0 0 Line 4b: C 1.551261 1 0.000000 0 0.000000 0 1 0 0 Line 4c: O 1.401861 1 108.919034 1 0.000000 0 2 1 0 Line 4d: C 1.401958 1 119.302489 1 -179.392581 1 3 2 1 Line 4e: C 1.551074 1 108.956238 1 179.014664 1 4 3 2 Line 4f: C 1.541928 1 113.074843 1 179.724877 1 5 4 3 Line 4g: C 1.551502 1 113.039652 1 179.525806 1 6 5 4 Line 4h: O 1.402677 1 108.663575 1 179.855864 1 7 6 5 Line 4i: C 1.402671 1 119.250433 1 -179.637345 1 8 7 6 Line 4j: C 1.552020 1 108.665746 1 -179.161900 1 9 8 7 Line 4k: XX 1.552507 1 112.659354 1 -178.914985 1 10 9 8 Line 4l: XX 1.547723 1 113.375266 1 -179.924995 1 11 10 9 Line 4m: H 1.114250 1 89.824605 1 126.911018 1 1 3 2 Line 4n: H 1.114708 1 89.909148 1 -126.650667 1 1 3 2 Line 4o: H 1.123297 1 93.602831 1 127.182594 1 2 4 3 Line 4p: H 1.123640 1 93.853406 1 -126.320187 1 2 4 3 Line 4q: H 1.123549 1 90.682924 1 126.763659 1 4 6 5 Line 4r: H 1.123417 1 90.679889 1 -127.033695 1 4 6 5 Line 4s: H 1.114352 1 90.239157 1 126.447043 1 5 7 6 Line 4t: H 1.114462 1 89.842852 1 -127.140168 1 5 7 6 Line 4u: H 1.114340 1 89.831790 1 126.653999 1 6 8 7 Line 4v: H 1.114433 1 89.753913 1 -126.926618 1 6 8 7 Line 4w: H 1.123126 1 93.644744 1 127.030541 1 7 9 8 Line 4x: H 1.123225 1 93.880969 1 -126.380511 1 7 9 8 Line 4y: H 1.123328 1 90.261019 1 127.815464 1 9 11 10 Line 4z: H 1.123227 1 91.051403 1 -125.914234 1 9 11 10 Line 4A: H 1.113970 1 90.374545 1 126.799259 1 10 12 11 1.6_The_data-file____________________________________________________________ Line 4B: H 1.114347 1 90.255788 1 -126.709810 1 10 12 11 Line 4C: Tv 12.299490 1 0.000000 0 0.000000 0 1 11 10 Line 5 : 0 0.000000 0 0.000000 0 0.000000 0 0 0 0 Polytetrahydrofuran has a repeat unit of (C4 H8 O)2 ; i.e., twice the monomer unit. This is necessary in order to allow the lattice to repeat after a translation through 12:3 A. See the section on Solid State Capability for further details. Note the two dummy atoms on lines 4k and 4l. These are useful, but not essential, for defining the geometry. The atoms on lines 4y to 4B use these dummy atoms, as does the translation vector on line 4C. The translation vector has only the length marked for optimization. The reason for this is also explained in the Background chapter. ___________________________________________________Description_of_MOPAC________________ Chapter 2 Keywords 2.1 Specification of keywords All control data are entered in the form of keywords, which form the first line of a data-file. A description of what each keyword does is given in Section 2.3. The order in which keywords appear is not important although they must be separated by a space. Some keywords can be abbreviated, allowed abbreviations are noted in Section 2.3 (for example 1ELECTRON can be entered as 1ELECT). However the full keyword is preferred in order to more clearly document the calculation and to obviate the possibility that an abbreviated keyword might not be recognized. If there is insufficient space in the first line for all the keywords needed, then consider abbreviating the longer words. One type of keyword, those with an equal sign, such as, BAR=0.05, may not be abbreviated, and the full word needs to be supplied. Most keywords which involve an equal sign, such as SCFCRT=1.D-12 can, at the user's discretion, be written with spaces before and after the equal sign. Thus all permutations of SCFCRT=1.D-12, such as SCFCRT =1.D-12, SCFCRT = 1.D-12, SCFCRT= 1.D-12, SCFCRT = 1.D-12, etc. are al- lowed. Exceptions to this are T=, T-PRIORITY=, H-PRIORITY=, X-PRIORITY=, IRC=, DRC= and TRANS=. ` T=' cannot be abbreviated to ` T ' as many keywords start or end with a `T'; for the other keywords the associated abbreviated keywords have specific meanings. If two keywords which are incompatible, like UHF and C.I., are supplied, or a keyword which is incompatible with the species supplied, for instance TRIPLET and a methyl radical, then error trapping will normally occur, and an error message will be printed. This usually takes an insignificant time, so data are quickly checked for obvious errors. 2.2 Full list of keywords used in MOPAC & - TURN NEXT LINE INTO KEYWORDS + - ADD ANOTHER LINE OF KEYWORDS 0SCF - READ IN DATA, THEN STOP 1ELECTRON- PRINT FINAL ONE-ELECTRON MATRIX 1SCF - DO ONE SCF AND THEN STOP AIDER - READ IN AB INITIO DERIVATIVES AIGIN - GEOMETRY MUST BE IN GAUSSIAN FORMAT AIGOUT - IN ARC FILE, INCLUDE AB-INITIO GEOMETRY ANALYT - USE ANALYTICAL DERIVATIVES OF ENERGY WRT GEOMETRY AM1 - USE THE AM1 HAMILTONIAN BAR=n.n - REDUCE BAR LENGTH BY A MAXIMUM OF n.n BIRADICAL- SYSTEM HAS TWO UNPAIRED ELECTRONS BONDS - PRINT FINAL BOND-ORDER MATRIX C.I. - A MULTI-ELECTRON CONFIGURATION INTERACTION SPECIFIED ______________________________________________________________Keywords_______ CHARGE=n - CHARGE ON SYSTEM = n (e.g. NH4 => CHARGE=1) COMPFG - PRINT HEAT OF FORMATION CALCULATED IN COMPFG CONNOLLY - USE CONNOLLY SURFACE DEBUG - DEBUG OPTION TURNED ON DENOUT - DENSITY MATRIX OUTPUT (CHANNEL 10) DENSITY - PRINT FINAL DENSITY MATRIX DEP - GENERATE FORTRAN CODE FOR PARAMETERS FOR NEW ELEMENTS DEPVAR=n - TRANSLATION VECTOR IS A MULTIPLE OF BOND-LENGTH DERIV - PRINT PART OF WORKING IN DERIV DFORCE - FORCE CALCULATION SPECIFIED, ALSO PRINT FORCE MATRIX. DFP - USE DAVIDON-FLETCHER-POWELL METHOD TO OPTIMIZE GEOMETRIES DIPOLE - FIT THE ESP TO THE CALCULATED DIPOLE DIPX - X COMPONENT OF DIPOLE TO BE FITTED DIPY - Y COMPONENT OF DIPOLE TO BE FITTED DIPZ - Z COMPONENT OF DIPOLE TO BE FITTED DMAX - MAXIMUM STEPSIZE IN EIGENVECTOR FOLLOWING DOUBLET - DOUBLET STATE REQUIRED DRC - DYNAMIC REACTION COORDINATE CALCULATION DUMP=n - WRITE RESTART FILES EVERY n SECONDS ECHO - DATA ARE ECHOED BACK BEFORE CALCULATION STARTS EF - USE EF ROUTINE FOR MINIMUM SEARCH EIGINV - EIGS - PRINT ALL EIGENVALUES IN ITER ENPART - PARTITION ENERGY INTO COMPONENTS ESP - ELECTROSTATIC POTENTIAL CALCULATION ESPRST - RESTART OF ELECTROSTATIC POTENTIAL ESR - CALCULATE RHF UNPAIRED SPIN DENSITY EXCITED - OPTIMIZE FIRST EXCITED SINGLET STATE EXTERNAL - READ PARAMETERS OFF DISK FILL=n - IN RHF OPEN AND CLOSED SHELL, FORCE M.O. n TO BE FILLED FLEPO - PRINT DETAILS OF GEOMETRY OPTIMIZATION FMAT - PRINT DETAILS OF WORKING IN FMAT FOCK - PRINT LAST FOCK MATRIX FORCE - FORCE CALCULATION SPECIFIED GEO-OK - OVERRIDE INTERATOMIC DISTANCE CHECK GNORM=n.n- EXIT WHEN GRADIENT NORM DROPS BELOW n.n GRADIENTS- PRINT ALL GRADIENTS GRAPH - GENERATE FILE FOR GRAPHICS HCORE - PRINT DETAILS OF WORKING IN HCORE HESS=N - OPTIONS FOR CALCULATING HESSIAN MATRICES IN EF H-PRIO - HEAT OF FORMATION TAKES PRIORITY IN DRC HYPERFINE- HYPERFINE COUPLING CONSTANTS TO BE CALCULATED IRC - INTRINSIC REACTION COORDINATE CALCULATION ISOTOPE - FORCE MATRIX WRITTEN TO DISK (CHANNEL 9 ) ITER - PRINT DETAILS OF WORKING IN ITER ITRY=N - SET LIMIT OF NUMBER OF SCF ITERATIONS TO N. IUPD - MODE OF HESSIAN UPDATE IN EIGENVECTOR FOLLOWING K=(N,N) - BRILLOUIN ZONE STRUCTURE TO BE CALCULATED KINETIC - EXCESS KINETIC ENERGY ADDED TO DRC CALCULATION LINMIN - PRINT DETAILS OF LINE MINIMIZATION LARGE - PRINT EXPANDED OUTPUT LET - OVERRIDE CERTAIN SAFETY CHECKS LOCALIZE - PRINT LOCALIZED ORBITALS 2.2_Full_list_of_keywords_used_in_MOPAC________________________________________________ MAX - PRINTS MAXIMUM GRID SIZE (23*23) MECI - PRINT DETAILS OF MECI CALCULATION MICROS - USE SPECIFIC MICROSTATES IN THE C.I. MINDO/3 - USE THE MINDO/3 HAMILTONIAN MMOK - USE MOLECULAR MECHANICS CORRECTION TO CONH BONDS MODE=N - IN EF, FOLLOW HESSIAN MODE NO. N MOLDAT - PRINT DETAILS OF WORKING IN MOLDAT MS=N - IN MECI, MAGNETIC COMPONENT OF SPIN MULLIK - PRINT THE MULLIKEN POPULATION ANALYSIS NLLSQ - MINIMIZE GRADIENTS USING NLLSQ NOANCI - DO NOT USE ANALYTICAL C.I. DERIVATIVES NODIIS - DO NOT USE DIIS GEOMETRY OPTIMIZER NOINTER - DO NOT PRINT INTERATOMIC DISTANCES NOLOG - SUPPRESS LOG FILE TRAIL, WHERE POSSIBLE NOMM - DO NOT USE MOLECULAR MECHANICS CORRECTION TO CONH BONDS NONR - NOTHIEL - DO NOT USE THIEL'S FSTMIN TECHNIQUE NSURF=N - NUMBER OF SURFACES IN AN ESP CALCULATION NOXYZ - DO NOT PRINT CARTESIAN COORDINATES NSURF - NUMBER OF LAYERS USED IN ELECTROSTATIC POTENTIAL OLDENS - READ INITIAL DENSITY MATRIX OFF DISK OLDGEO - PREVIOUS GEOMETRY TO BE USED OPEN - OPEN-SHELL RHF CALCULATION REQUESTED ORIDE - PARASOK - IN AM1 CALCULATIONS SOME MNDO PARAMETERS ARE TO BE USED PI - RESOLVE DENSITY MATRIX INTO SIGMA AND PI BONDS PL - MONITOR CONVERGENCE OF DENSITY MATRIX IN ITER PM3 - USE THE MNDO-PM3 HAMILTONIAN POINT=N - NUMBER OF POINTS IN REACTION PATH POINT1=N - NUMBER OF POINTS IN FIRST DIRECTION IN GRID CALCULATION POINT2=N - NUMBER OF POINTS IN SECOND DIRECTION IN GRID CALCULATION POLAR - CALCULATE FIRST, SECOND AND THIRD ORDER POLARIZABILITIES POTWRT - IN ESP, WRITE OUT ELECTROSTATIC POTENTIAL TO UNIT 21 POWSQ - PRINT DETAILS OF WORKING IN POWSQ PRECISE - CRITERIA TO BE INCREASED BY 100 TIMES PULAY - USE PULAY'S CONVERGER TO OBTAIN A SCF QUARTET - QUARTET STATE REQUIRED QUINTET - QUINTET STATE REQUIRED RECALC=N - IN EF, RECALCULATE HESSIAN EVERY N STEPS RESTART - CALCULATION RESTARTED ROOT=n - ROOT n TO BE OPTIMIZED IN A C.I. CALCULATION ROT=n - THE SYMMETRY NUMBER OF THE SYSTEM IS n. SADDLE - OPTIMIZE TRANSITION STATE SCALE - SCALING FACTOR FOR VAN DER WAALS DISTANCE IN ESP SCFCRT=n - DEFAULT SCF CRITERION REPLACED BY THE VALUE SUPPLIED SCINCR - INCREMENT BETWEEN LAYERS IN ESP SETUP - EXTRA KEYWORDS TO BE READ OF SETUP FILE SEXTET - SEXTET STATE REQUIRED SHIFT=n - A DAMPING FACTOR OF n DEFINED TO START SCF SIGMA - MINIMIZE GRADIENTS USING SIGMA SINGLET - SINGLET STATE REQUIRED SLOPE - MULTIPLIER USED TO SCALE MNDO CHARGES SPIN - PRINT FINAL UHF SPIN MATRIX STEP - STEP SIZE IN PATH ______________________________________________________________Keywords_______ STEP1=n - STEP SIZE n FOR FIRST COORDINATE IN GRID CALCULATION STEP2=n - STEP SIZE n FOR SECOND COORDINATE IN GRID CALCULATION STO-3G - DEORTHOGONALIZE ORBITALS IN STO-3G BASIS SYMAVG - AVERAGE SYMMETRY EQUIVALENT ESP CHARGES SYMMETRY - IMPOSE SYMMETRY CONDITIONS T=n - A TIME OF n SECONDS REQUESTED THERMO - PERFORM A THERMODYNAMICS CALCULATION TIMES - PRINT TIMES OF VARIOUS STAGES T-PRIO - TIME TAKES PRIORITY IN DRC TRANS - THE SYSTEM IS A TRANSITION STATE (USED IN THERMODYNAMICS CALCULATION) TRIPLET - TRIPLET STATE REQUIRED TS - USING EF ROUTINE FOR TS SEARCH UHF - UNRESTRICTED HARTREE-FOCK CALCULATION VECTORS - PRINT FINAL EIGENVECTORS VELOCITY - SUPPLY THE INITIAL VELOCITY VECTOR IN A DRC CALCULATION WILLIAMS - USE WILLIAMS SURFACE X-PRIO - GEOMETRY CHANGES TAKE PRIORITY IN DRC XYZ - DO ALL GEOMETRIC OPERATIONS IN CARTESIAN COORDINATES. 2.3 Definitions of keywords The definitions below are given with some technical expressions which are not further defined. Interested users are referred to Appendix E of this manual to locate appropriate references which will provide further clarification. There are three classes of keywords: 1. those which CONTROL substantial aspects of the calculation, i.e., those which affect the final heat of formation, 2. those which determine which OUTPUT will be calculated and printed, and 3. those which dictate the WORKING of the calculation, but which do not affect the heat of formation. The assignment to one of these classes is designated by a (C), (O) or (W), respectively, following each keyword in the list below. & (C) An `_&' means `turn the next line into keywords'. Note the space before the `&' sign. Since `&' is a keyword, it must be preceeded by a space. A `_&' on line 1 would mean that a second line of keywords should be read in. If that second line contained a `_&', then a third line of keywords would be read in. If the first line has a `_&' then the first description line is omitted, if the second line has a `_&', then both description lines are omitted. Examples: Use of one `&' VECTORS DENSITY RESTART & NLLSQ T=1H SCFCRT=1.D-8 DUMP=30M ITRY=300 PM3 FOCK OPEN(2,2) ROOT=3 SINGLET SHIFT=30 Test on a totally weird system: Use of two `&'s LARGE=-10 & DRC=4.0 T=1H SCFCRT=1.D-8 DUMP=30M ITRY=300 SHIFT=30 PM3 OPEN(2,2) ROOT=3 SINGLET NOANCI ANALYT T-PRIORITY=0.5 & LET GEO-OK VELOCITY KINETIC=5.0 2.3_Definitions_of_keywords__________________________________________________ + (C) A `_+' sign means `read another line of keywords'. Note the space before the `+' sign. Since `+' is a keyword, it must be preceeded by a space. A `_+' on line 1 would mean that a second line of keywords should be read in. If that second line contains a `_+', then a third line of keywords will be read in. Regardless of whether a second or a third line of keywords is read in, the next two lines would be description lines. Example of `_+' option RESTART T=4D FORCE OPEN(2,2) SHIFT=20 PM3 + SCFCRT=1.D-8 DEBUG + ISOTOPE FMAT ECHO singlet ROOT=3 THERMO(300,400,1) ROT=3 Example of data set with three lines of keywords. Note: There are two lines of description, this and the previous line. 0SCF (O) The data can be read in and output, but no actual calculation is performed when this keyword is used. This is useful as a check on the input data. All obvious errors are trapped, and warning messages printed. A second use is to convert from one format to another. The input geometry is printed in various formats at the end of a 0SCF calculation. If NOINTER is absent, cartesian coordinates are printed. Unconditionally, MOPAC Z-matrix internal coordinates are printed, and if AIGOUT is present, Gaussian Z-matrix internal coordinates are printed. 0SCF should now be used in place of DDUM. 1ELECTRON (O) The final one-electron matrix is printed out. This matrix is composed of atomic orbitals; the array element between orbitals i and j on different atoms is given by: H(i; j) = 0:5 x (fii + fij ) x overlap (i; j) The matrix elements between orbitals i and j on the same atom are calculated from the electron- nuclear attraction energy, and also from the U (i) value if i = j. The one-electron matrix is unaffected by (a) the charge and (b) the electron density. It is only a function of the geometry. Abbreviation: 1ELEC. 1SCF (C) When users want to examine the results of a single SCF calculation of a geometry, 1SCF should be used. 1SCF can be used in conjunction with RESTART, in which case a single SCF calculation will be done, and the results printed. When 1SCF is used on its own (that is, RESTART is not also used) then derivatives will only be calculated if GRAD is also specified. 1SCF is helpful in a learning situation. MOPAC normally performs many SCF calculations, and in order to minimize output when following the working of the SCF calculation, 1SCF is very useful. AIDER (C) AIDER allows MOPAC to optimize an ab-initio geometry. To use it, calculate the ab-initio gradients using, e.g., Gaussian. Supply MOPAC with these gradients, after converting them into kcal/mol. The geometry resulting from a MOPAC run will be nearer to the optimized ab-initio geometry than if the geometry optimizer in Gaussian had been used. ______________________________________________________________Keywords_______ AIGIN (C) If the geometry (Z-matrix) is specified using the Gaussian-8X, then normally this will be read in without difficulty. In the event that it is mistaken for a normal MOPAC-type Z-matrix, the keyword AIGIN is provided. AIGIN will force the data-set to be read in assuming Gaussian format. This is necessary if more than one system is being studied in one run. AIGOUT (O) The ARCHIVE file contains a data-set suitable for submission to MOPAC. If, in addition to this data-set, the Z-matrix for Gaussian input is wanted, then AIGOUT (ab initio geometry output), should be used. The Z-matrix is in full Gaussian form. Symmetry, where present, will be correctly defined. Names of symbolics will be those used if the original geometry was in Gaussian format, otherwise `logical' names will be used. Logical names are of form [][] where is `r' for bond length, `a' for angle, or `d' for dihedral, is the atom number, is the atom to which is related, , if present, is the atom number to which makes an angle, and , if present, is the atom number to which makes a dihedral. ANALYT (W) By default, finite difference derivatives of energy with respect to geometry are used. If ANALYT is specified, then analytical derivatives are used instead. Since the analytical derivatives are over Gaussian functions_a STO-6G basis set is used_the overlaps are also over Gaussian functions. This will result in a very small (less than 0.1 kcal/mole) change in heat of formation. Use analytical derivatives (a) when the mantissa used is less than about 51-53 bits, or (b) when comparison with finite difference is desired. Finite difference derivatives are still used when non-variationally optimized wavefunctions are present. AM1 (C) The AM1 method is to be used. By default MNDO is run. BAR=n.nn (W) In the SADDLE calculation the distance between the two geometries is steadily reduced until the transition state is located. Sometimes, however, the user may want to alter the maximum rate at which the distance between the two geometries reduces. BAR is a ratio, normally 0.15, or 15 percent. This represents a maximum rate of reduction of the bar of 15 percent per step. Alternative values that might be considered are BAR=0.05 or BAR=0.10, although other values may be used. See also SADDLE. If CPU time is not a major consideration, use BAR=0.03. BIRADICAL (C) Note: BIRADICAL is a redundant keyword, and represents a particular configuration interaction calculation. Experienced users of MECI (q.v.) can duplicate the effect of the keyword BIRADICAL by using the MECI keywords OPEN(2,2) and SINGLET. For molecules which are believed to have biradicaloid character the option exists to optimize the lowest singlet energy state which results from the mixing of three states. These states are, in order, (1) the (micro)state arising from a one electron excitation from the HOMO to the LUMO, which is combined with the microstate resulting from the time-reversal operator acting on the parent microstate, the result being a full singlet state; (2) the state resulting from de-excitation from the formal LUMO to the HOMO; and (3) the state resulting from the single electron in the formal HOMO being excited into the LUMO. 2.3_Definitions_of_keywords__________________________________________________ Microstate 1 Microstate 2 Microstate 3 Alpha Beta Alpha Beta Alpha Beta Alpha Beta LUMO * * * * --- --- --- --- --- --- --- --- + HOMO * * * * --- --- --- --- --- --- --- --- A configuration interaction calculation is involved here. A biradical calculation done without C.I. at the RHF level would be meaningless. Either rotational invariance would be lost, as in the D2d form of ethylene, or very artificial barriers to rotations would be found, such as in a methane molecule "orbiting" a D2d ethylene. In both cases the inclusion of limited configuration interaction corrects the error. BIRADICAL should not be used if either the HOMO or LUMO is degenerate; in this case, the full manifold of HOMO x LUMO should be included in the C.I., using MECI options. The user should be aware of this situation. When the biradical calculation is performed correctly, the result is normally a net stabilization. However, if the first singlet excited state is much higher in energy than the closed-shell ground state, BIRADICAL can lead to a destabilization. Abbreviation: BIRAD. See also MECI, C.I., OPEN, SINGLET. BONDS (O) The rotationally invariant bond order between all pairs of atoms is printed. In this context a bond is defined as the sum of the squares of the density matrix elements connecting any two atoms. For ethane, ethylene, and acetylene the carbon-carbon bond orders are roughly 1.00, 2.00, and 3.00 respectively. The diagonal terms are the valencies calculated from the atomic terms only and are defined as the sum of the bonds the atom makes with other atoms. In UHF and non- variationally optimized wavefunctions the calculated valency will be incorrect, the degree of error being proportional to the non-duodempotency of the density matrix. For an RHF wavefunction the square of the density matrix is equal to twice the density matrix. The bonding contributions of all M.O.'s in the system are printed immediately before the bonds matrix. The idea of molecular orbital valency was developed by Gopinathan, Siddarth, and Ravimohan. Just as an atomic orbital has a `valency', so has a molecular orbital. This leads to the following relations: The sum of the bonding contributions of all occupied M.O.'s is the same as the sum of all valencies which, in turn is equal to two times the sum of all bonds. The sum of the bonding contributions of all M.O.'s is zero. C.I.=n (C) Normally configuration interaction is invoked if any of the keywords which imply a C.I. calculation are used, such as BIRADICAL, TRIPLET or QUARTET. Note that ROOT= does not imply a C.I. calculation: ROOT= is only used when a C.I. calculation is done. However, as these implied C.I.'s involve the minimum number of configurations practical, the user may want to define a larger than minimum C.I., in which case the keyword C.I.=n can be used. When C.I.=n is specified, the n M.O.'s which `bracket' the occupied- virtual energy levels will be used. Thus, C.I.=2 will include both the HOMO and the LUMO, while C.I.=1 (implied for odd-electron systems) will only include the HOMO (This will do nothing for a closed-shell system, and leads to Dewar's ______________________________________________________________Keywords_______ half-electron correction for odd-electron systems). Users should be aware of the rapid increase in the size of the C.I. with increasing numbers of M.O.'s being used. Numbers of microstates implied by the use of the keyword C.I.=n on its own are as follows: Keyword Even-electron systems Odd-electron systems No. of electrons, configs No. of electrons, configs Alpha Beta Alpha Beta C.I.=1 1 1 1 1 0 1 C.I.=2 1 1 4 1 0 2 C.I.=3 2 2 9 2 1 9 C.I.=4 2 2 36 2 1 24 C.I.=5 3 3 100 3 2 100 C.I.=6 3 3 400 3 2 300 C.I.=7 4 4 1225 4 3 1225 C.I.=8 (Do not use unless other keywords also used, see below) If a change of spin is defined, then larger numbers of M.O.'s can be used up to a maximum of 10. The C.I. matrix is of size 100 x 100. For calculations involving up to 100 configurations, the spin- states are exact eigenstates of the spin operators. For systems with more than 100 configurations, the 100 configurations of lowest energy are used. See also MICROS and the keywords defining spin-states. Note that for any system, use of C.I.=5 or higher normally implies the diagonalization of a 100 by 100 matrix. As a geometry optimization using a C.I. requires the derivatives to be calculated using derivatives of the C.I. matrix, geometry optimization with large C.I.'s will require more time than smaller C.I.'s. Associated keywords: MECI, ROOT=, MICROS, SINGLET, DOUBLET, etc. C.I.=(n,m) In addition to specifying the number of M.O.'s in the active space, the number of electrons can also be defined. In C.I.=(n,m), n is the number of M.O.s in the active space, and m is the number of doubly filled levels to be used. Examples: Keywords Number of M.O.s No. Electrons C.I.=2 2 2 (1) C.I.=(2,1) 2 2 (3) C.I.=(3,1) 3 2 (3) C.I.=(3,2) 3 4 (5) C.I.=(3,0) OPEN(2,3) 3 2 (N/A) C.I.=(3,1) OPEN(2,2) 3 4 (N/A) C.I.=(3,1) OPEN(1,2) 3 N/A (3) Odd electron systems given in parentheses. CHARGE=n (C) When the system being studied is an ion, the charge, n, on the ion must be supplied by CHARGE=n. For cations n can be 1, 2, 3, etc, for anions -1 or -2 or -3, etc. Examples: ION KEYWORD ION KEYWORD NH4(+) CHARGE=1 CH3COO(-) CHARGE=-1 C2H5(+) CHARGE=1 (COO)(=) CHARGE=-2 SO4(=) CHARGE=-2 PO4(3-) CHARGE=-3 HSO4(-) CHARGE=-1 H2PO4(-) CHARGE=-1 2.3_Definitions_of_keywords__________________________________________________ DCART (O) The cartesian derivatives which are calculated in DCART for variationally optimized systems are printed if the keyword DCART is present. The derivatives are in units of kcals/Angstrom, and the coordinates are displacements in x, y, and z. DEBUG (O) Certain keywords have specific output control meanings, such as FOCK, VECTORS and DEN- SITY. If they are used, only the final arrays of the relevant type are printed. If DEBUG is supplied, then all arrays are printed. This is useful in debugging ITER. DEBUG can also increase the amount of output produced when certain output keywords are used, e.g. COMPFG. DENOUT (O) The density matrix at the end of the calculation is to be output in a form suitable for input in another job. If an automatic dump due to the time being exceeded occurs during the current run then DENOUT is invoked automatically. (see RESTART) DENSITY (O) At the end of a job, when the results are being printed, the density matrix is also printed. For RHF the normal density matrix is printed. For UHF the sum of the alpha and beta density matrices is printed. If density is not requested, then the diagonal of the density matrix, i.e., the electron density on the atomic orbitals, will be printed. DEP (O) For use only with EXTERNAL=. When new parameters are published, they can be entered at run-time by using EXTERNAL=, but as this is somewhat clumsy, a permanent change can be made by use of DEP. If DEP is invoked, a complete block of FORTRAN code will be generated, and this can be inserted directly into the BLOCK DATA file. Note that the output is designed for use with PM3. By modifying the names, the output can be used with MNDO or AM1. DEPVAR=n.nn (C) In polymers the translation vector is frequently a multiple of some internal distance. For example, in polythene it is the C1-C3 distance. If a cluster unit cell of C6H12 is used, then symmetry can be used to tie together all the carbon atom coordinates and the translation vector distance. In this example DEPVAR=3.0 would be suitable. DFP (W) By default the Broyden-Fletcher-Goldfarb-Shanno method will be used to optimize geometries. The older Davidon-Fletcher-Powell method can be invoked by specifying DFP. This is intended to be used for comparison of the two methods. DIPOLE (C) Used in the ESP calculation, DIPOLE will constrain the calculated charges to reproduce the cartesian dipole moment components calculated from the density matrix and nuclear charges. ______________________________________________________________Keywords_______ DIPX (C) Similar to DIPOLE, except the fit will be for the X-component only. DIPY (C) Similar to DIPOLE, except the fit will be for the Y-component only. DIPZ (C) Similar to DIPOLE, except the fit will be for the Z-component only. DMAX=n.nn (W) In the EF routine, the maximum step-size is 0:2 (Angstroms or radians), by default. This can be changed by specifying DMAX=n.nn. Increasing DMAX can lead to faster convergence but can also make the optimization go bad very fast. Furthermore, the Hessian updating may deteriorate when using large stepsizes. Reducing the stepsize to 0:10 or 0:05 is recommended when encountering convergence problems. DOUBLET (C) When a configuration interaction calculation is done, all spin states are calculated simultaneously, either for component of spin=0 or 1/2. When only doublet states are of interest, then DOUBLET can be specified, and all other spin states, while calculated, are ignored in the choice of root to be used. Note that while almost every odd-electron system will have a doublet ground state, DOUBLET should still be specified if the desired state must be a doublet. DOUBLET has no meaning in a UHF calculation. DRC (C) A Dynamic Reaction Coordinate calculation is to be run. By default, total energy is conserved, so that as the `reaction' proceeds in time, energy is transferred between kinetic and potential forms. DRC=n.nnn (C) In a DRC calculation, the `half-life' for loss of kinetic energy is defined as n.nnn femtoseconds. If n.nnn is set to zero, infinite damping simulating a very condensed phase is obtained. This keyword cannot be written with spaces around the `=' sign. DUMP (W) Restart files are written automatically at one hour cpu time intervals to allow a long job to be restarted if the job is terminated catastrophically. To change the frequency of dump, set DUMP=nn to request a dump every nn seconds. Alternative forms, DUMP=nnM, DUMP=nnH, DUMP=nnD for a dump every nn minutes, hours, or days, respectively. DUMP only works with geometry optimization, gradient minimization, path, and FORCE calculations. It does not (yet) work with a SADDLE calculation. ECHO (O) Data are echoed back if ECHO is specified. Only useful if data are suspected to be corrupt. 2.3_Definitions_of_keywords__________________________________________________ EF (C) The Eigenvector Following routine is an alternative to the BFGS, and appears to be much faster. To invoke the Eigenvector Following routine, specify EF. EF is particularly good in the end-game, when the gradient is small. See also HESS, DMAX, EIGINV. EIGINV (W) Not recommended for normal use. Used with the EF routine. See source code for more details. ENPART (O) This is a very useful tool for analyzing the energy terms within a system. The total energy, in eV, obtained by the addition of the electronic and nuclear terms, is partitioned into mono- and bi-centric contributions, and these contributions in turn are divided into nuclear and one- and two-electron terms. ESP (C) This is the ElectroStatic Potential calculation of K. M. Merz and B. H. Besler. ESP calculates the expectation values of the electrostatic potential of a molecule on a uniform distribution of points. The resultant ESP surface is then fitted to atom centered charges that best reproduce the distribution, in a least squares sense. ESPRST (W) ESPRST restarts a stopped ESP calculation. Do not use with RESTART. ESR (O) The unpaired spin density arising from an odd-electron system can be calculated both RHF and UHF. In a UHF calculation the alpha and beta M.O.'s have different spatial forms, so unpaired spin density can naturally be present on in-plane hydrogen atoms such as in the phenoxy radical. In the RHF formalism a MECI calculation is performed. If the keywords OPEN and C.I.= are both absent then only a single state is calculated. The unpaired spin density is then calculated from the state function. In order to have unpaired spin density on the hydrogens in, for example, the phenoxy radical, several states should be mixed. EXCITED (C) The state to be calculated is the first excited open-shell singlet state. If the ground state is a singlet, then the state calculated will be S(1); if the ground state is a triplet, then S(2). This state would normally be the state resulting from a one-electron excitation from the HOMO to the LUMO. Exceptions would be if the lowest singlet state were a biradical, in which case the EXCITED state could be a closed shell. The EXCITED state will be calculated from a BIRADICAL calculation in which the second root of the C.I. matrix is selected. Note that the eigenvector of the C.I. matrix is not used in the current formalism. Abbreviation: EXCI. Note: EXCITED is a redundant keyword, and represents a particular configuration interaction calculation. Experienced users of MECI can duplicate the effect of the keyword EXCITED by using the MECI keywords OPEN(2,2), SINGLET, and ROOT=2. ______________________________________________________________Keywords_______ EXTERNAL=name (C) Normally, PM3, AM1 and MNDO parameters are taken from the BLOCK DATA files within MOPAC. When the supplied parameters are not suitable, as in an element recently parameterized, and the parameters have not yet installed in the user's copy of MOPAC, then the new parameters can be inserted at run time by use of EXTERNAL=, where is the name of the file which contains the new parameters. consists of a series of parameter definitions in the format: where the possible parameters are USS, UPP, UDD, ZS, ZP, ZD, BETAS, BETAP, BETAD, GSS, GSP, GPP, GP2, HSP, ALP, FNnm, n=1,2, or 3, and m=1 to 10, and the elements are defined by their chemical symbols, such as Si or SI. When new parameters for elements are published, they can be typed in as shown. This file is ended by a blank line, the word END or nothing, i.e., no end-of-file delimiter. An example of a parameter data file would be (put at least 2 spaces before and after parameter name): Line 1: USS Si -34.08201495 Line 2: UPP Si -28.03211675 Line 3: BETAS Si -5.01104521 Line 4: BETAP Si -2.23153969 Line 5: ZS Si 1.28184511 Line 6: ZP Si 1.84073175 Line 7: ALP Si 2.18688712 Line 8: GSS Si 9.82 Line 9: GPP Si 7.31 Line 10: GSP Si 8.36 Line 11: GP2 Si 6.54 Line 12: HSP Si 1.32 Derived parameters do no need to be entered; they will be calculated from the optimized parameters. All "constants" such as the experimental heat of atomization are already inserted for all elements. NOTE: EXTERNAL can only be used to input parameters for MNDO, AM1, or PM3. It is unlikely, however, that any more MINDO/3 parameters will be published. See also DEP to make a permanent change. FILL=n (C) The n'th M.O. in an RHF calculation is constrained to be filled. It has no effect on a UHF calculation. After the first iteration (NOTE: not after the first SCF calculation, but after the first iteration within the first SCF calculation) the n'th M.O. is stored, and, if occupied, no further action is taken at that time. If unoccupied, then the HOMO and the n'th M.O.'s are swapped around, so that the n'th M.O. is now filled. On all subsequent iterations the M.O. nearest in character to the stored M.O. is forced to be filled, and the stored M.O. replaced by that M.O. This is necessitated by the fact that in a reaction a particular M.O. may change its character considerably. A useful procedure is to run 1SCF and DENOUT first, in order to identify the M.O.'s; the complete job is then run with OLDENS and FILL=nn, so that the eigenvectors at the first iteration are fully known. As FILL is known to give difficulty at times, consider also using C.I.=n and ROOT=m. FLEPO (O) The predicted and actual changes in the geometry, the derivatives, and search direction for each geometry optimization cycle are printed. This is useful if there is any question regarding the efficiency of the geometry optimizer. 2.3_Definitions_of_keywords__________________________________________________ FMAT Details of the construction of the Hessian matrix for the force calculation are to be printed. FORCE (C) A force-calculation is to be run. The Hessian, that is the matrix (in millidynes per Angstrom) of second derivatives of the energy with respect to displacements of all pairs of atoms in x, y, and z directions, is calculated. On diagonalization this gives the force constants for the molecule. The force matrix, weighted for isotopic masses, is then used for calculating the vibrational frequencies. The system can be characterized as a ground state or a transition state by the presence of five (for a linear system) or six eigenvalues which are very small (less than about 30 reciprocal centimeters). A transition state is further characterized by one, and exactly one, negative force constant. A FORCE calculation is a prerequisite for a THERMO calculation. Before a FORCE calculation is started, a check is made to ensure that a stationary point is being used. This check involves calculating the gradient norm (GNORM) and if it is significant, the GNORM will be reduced using BFGS. All internal coordinates are optimized, and any symmetry constraints are ignored at this point. An implication of this is that if the specification of the geometry relies on any angles being exactly 180 or zero degrees, the calculation may fail. The geometric definition supplied to FORCE should not rely on angles or dihedrals assuming exact values. (The test of exact linearity is sufficiently slack that most molecules that are linear, such as acetylene and but-2-yne, should not be stopped.) See also THERMO, LET, TRANS, ISOTOPE. In a FORCE calculation, PRECISE will eliminate quartic contamination (part of the anhar- monicity). This is normally not important, therefore PRECISE should not routinely be used. In a FORCE calculation, the SCF criterion is automatically made more stringent; this is the main cause of the SCF failing in a FORCE calculation. GEO-OK (W) Normally the program will stop with a warning message if two atoms are within 0:8 Angstroms of each other, or, more rarely, the BFGS routine has difficulty optimizing the geometry. GEO-OK will over-ride the job termination sequence, and allow the calculation to proceed. In practice, most jobs that terminate due to these checks contain errors in data, so caution should be exercised if GEO-OK is used. An important exception to this warning is when the system contains, or may give rise to, a Hydrogen molecule. GEO-OK will override other geometric safety checks such as the unstable gradient in a geometry optimization preventing reliable optimization. See also the message "GRADIENTS OF OLD GEOMETRY, GNORM= nn.nnnn". GNORM=n.nn (W) The geometry optimization termination criteria in both gradient minimization and energy mini- mization can be over-ridden by specifying a gradient norm requirement. For example, GNORM=20 would allow the geometry optimization to exit as soon as the gradient norm dropped below 20.0, the default being 1.0. For high-precision work, GNORM=0.0 is recommended. Unless LET is also used, the GNORM will be set to the larger of 0.01 and the specified GNORM. Results from GNORM=0.01 are easily good enough for all high-precision work. GRADIENTS (O) In a 1SCF calculation gradients are not calculated by default: in non-variationally optimized systems this would take an excessive time. GRADIENTS allows the gradients to be calculated. ______________________________________________________________Keywords_______ Normally, gradients will not be printed if the gradient norm is less than 2.0. However, if GRA- DIENTS is present, then the gradient norm and the gradients will unconditionally be printed. Abbreviation: GRAD. GRAPH (O) Information needed to generate electron density contour maps can be written to a file by calling GRAPH. GRAPH first calls MULLIK in order to generate the inverse-square-root of the overlap matrix, which is required for the re-normalization of the eigenvectors. All data essential for the graphics package DENSITY are then output. HESS=n (W) When the Eigenvector Following routine is used for geometry optimization, it frequently works faster if the Hessian is constructed first. If HESS=1 is specified, the Hessian matrix will be con- structed before the geometry is optimized. There are other, less common, options, e.g. HESS=2. See comments in subroutine EF for details. H-PRIORITY (O) In a DRC calculation, results will be printed whenever the calculated heat of formation changes by 0.1 kcal/mole. Abbreviation: H-PRIO. H-PRIORITY=n.nn (O) In a DRC calculation, results will be printed whenever the calculated heat of formation changes by n.nn kcal/mole. IRC (C) An Intrinsic Reaction Coordinate calculation is to be run. All kinetic energy is shed at every point in the calculation. See Background. IRC=n (C) An Intrinsic Reaction Coordinate calculation to be run; an initial perturbation in the direction of normal coordinate n to be applied. If n is negative, then perturbation is reversed, i.e., initial motion is in the opposite direction to the normal coordinate. This keyword cannot be written with spaces around the `=' sign. ISOTOPE (O) Generation of the FORCE matrix is very time-consuming, and in isotopic substitution studies several vibrational calculations may be needed. To allow the frequencies to be calculated from the (constant) force matrix, ISOTOPE is used. When a FORCE calculation is completed, ISOTOPE will cause the force matrix to be stored, regardless of whether or not any intervening restarts have been made. To re-calculate the frequencies, etc. starting at the end of the force matrix calculation, specify RESTART. The two keywords RESTART and ISOTOPE can be used together. For example, if a normal FORCE calculation runs for a long time, the user may want to divide it up into stages and save the final force matrix. Once ISOTOPE has been used, it does not need to be used on subsequent RESTART runs. ISOTOPE can also be used with FORCE to set up a RESTART file for an IRC=n calculation. 2.3_Definitions_of_keywords__________________________________________________ ITRY=NN (W) The default maximum number of SCF iterations is 200. When this limit presents difficulty, ITRY=nn can be used to re-define it. For example, if ITRY=400 is used, the maximum number of iterations will be set to 400. ITRY should normally not be changed until all other means of obtaining a SCF have been exhausted, e.g. PULAY CAMP-KING etc. IUPD=n (W) IUPD is used only in the EF routine. IUPD should very rarely be touched. IUPD=1 can be used in minimum searches if the message "HEREDITARY POSITIVE DEFINITENESS ENDANGERED. UPDATE SKIPPED THIS CYCLE" occurs every cycle for 10-20 iterations. Never use IUPD=2 for a TS search! For more information, read the comments in subroutine EF. K=(n.nn,n) (C) Used in band-structure calculations, K=(n.nn,n) specifies the step-size in the Brillouin zone, and the number of atoms in the monomeric unit. Two band-structure calculations are supported: electronic and phonon. Both require a polymer to be used. If FORCE is used, a phonon spectrum is assumed, otherwise an electronic band structure is assumed. For both calculations, a density of states is also done. The band structure calculation is very fast, so a small step-size will not use much time. The output is designed to be fed into a graphics package, and is not `elegant'. For polyethylene, a suitable keyword would be K=(0.01,6). KINETIC=n.nnn (C) In a DRC calculation n.nnn kcals/mole of excess kinetic energy is added to the system as soon as the kinetic energy builds up to 0.2 kcal/mole. The excess energy is added to the velocity vector, without change of direction. LARGE (O) Most of the time the output invoked by keywords is sufficient. LARGE will cause less-commonly wanted, but still useful, output to be printed. 1. To save space, DRC and IRC outputs will, by default, only print the line with the percent sign. Other output can be obtained by use of the keyword LARGE, according to the following rules: LARGE Print all internal and cartesian coordinates and cartesian velocities. LARGE=1 Print all internal coordinates. LARGE=-1 Print all internal and cartesian coordinates and cartesian velocities. LARGE=n Print every n'th set of internal coordinates. LARGE=-n Print every n'th set of internal and cartesian coordinates and cartesian velocities. If LARGE=1 is used, the output will be the same as that of Version 5.0, when LARGE was not used. If LARGE is used, the output will be the same as that of Version 5.0, when LARGE was used. To save disk space, do not use LARGE. ______________________________________________________________Keywords_______ LINMIN (O) There are two line-minimization routines in MOPAC, an energy minimization and a gradient norm minimization. LINMIN will output details of the line minimization used in a given job. LET (W) As MOPAC evolves, the meaning of LET is changing. Now LET means essentially "I know what I'm doing, override safety checks". Currently, LET has the following meanings: 1. In a FORCE calculation, it means that the supplied geometry is to be used, even if the gradients are large. 2. In a geometry optimization, the specified GNORM is to be used, even if it is less than 0.01. 3. In a POLAR calculation, the molecule is to be orientated along its principal moments of inertia before the calculation starts. LET will prevent this step being done. LOCALIZE (O) The occupied eigenvectors are transformed into a localized set of M.O.'s by a series of 2 by 2 rotations which maximize <_4 >. The value of 1=<_4 > is a direct measure of the number of centers involved in the MO. Thus the value of 1=<_4 > is 2.0 for H2, 3.0 for a three-center bond and 1.0 for a lone pair. Higher degeneracies than allowed by point group theory are readily obtained. For example, benzene would give rise to a 6-fold degenerate C-H bond, a 6-fold degenerate C-C sigma bond and a three-fold degenerate C-C pi bond. In principle, there is no single step method to unambiguously obtain the most localized set of M.O.'s in systems where several canonical structures are possible, just as no simple method exists for finding the most stable conformer of some large compound. However, the localized bonds generated will normally be quite acceptable for routine applications. Abbreviation: LOCAL. MAX In a grid calculation, the maximum number of points (23) in each direction is to be used. The default is 11. The number of points in each direction can be set with POINTS1 and POINTS2. MECI (O) At the end of the calculation details of the Multi Electron Configuration Interaction calculation are printed if MECI is specified. The state vectors can be printed by specifying VECTORS. The MECI calculation is either invoked automatically, or explicitly invoked by the use of the C.I.=n keyword. MICROS=n (C) The microstates used by MECI are normally generated by use of a permutation operator. When individually defined microstates are desired, then MICROS=n can be used, where n defines the number of microstates to be read in. Format for Microstates After the geometry data plus any symmetry data are read in, data defining each microstate is read in, using format 20I1, one microstate per line. The microstate data is preceded by the word "MICROS" on a line by itself. There is at present no mechanism for using MICROS with a reaction path. 2.3_Definitions_of_keywords__________________________________________________ For a system with n M.O.'s in the C.I. (use OPEN=(n1,n) or C.I.=n to do this), the populations of the n alpha M.O.'s are defined, followed by the n beta M.O.'s. Allowed occupancies are zero and one. For n=6 the closed-shell ground state would be defined as 111000111000, meaning one electron in each of the first three alpha M.O.'s, and one electron in each of the first three beta M.O.'s. Users are warned that they are responsible for completing any spin manifolds. Thus while the state 111100110000 is a triplet state with component of spin = 1, the state 111000110100, while having a component of spin = 0 is neither a singlet nor a triplet. In order to complete the spin manifold the microstate 110100111000 must also be included. If a manifold of spin states is not complete, then the eigenstates of the spin operator will not be quantized. When and only when 100 or fewer microstates are supplied, can spin quantization be conserved. There are two other limitations on possible microstates. First, the number of electrons in every microstate should be the same. If they differ, a warning message will be printed, and the calculation continued (but the results will almost certainly be nonsense). Second, the component of spin for every microstate must be the same, except for teaching purposes. Two microstates of different components of spin will have a zero matrix element connecting them. No warning will be given as this is a reasonable operation in a teaching situation. For example, if all states arising from two electrons in two levels are to be calculated, say for teaching Russel-Saunders coupling, then the following microstates would be used: Microstate No. of alpha, beta electrons Ms State 1100 2 0 1 Triplet 1010 1 1 0 Singlet 1001 1 1 0 Mixed 0110 1 1 0 Mixed 0101 1 1 0 Singlet 0011 0 2 -1 Triplet Constraints on the space manifold are just as rigorous, but much easier to satisfy. If the energy levels are degenerate, then all components of a manifold of degenerate M.O.'s should be either included or excluded. If only some, but not all, components are used, the required degeneracy of the states will be missing. As an example, for the tetrahedral methane cation, if the user supplies the microstates corre- sponding to a component of spin = 3/2, neglecting Jahn-Teller distortion, the minimum number of states that can be supplied is 90 = (6!=(1!5!))(6!=(4!2!)). While the total number of electrons should be the same for all microstates, this number does not need to be the same as the number of electrons supplied to the C.I.; thus in the example above, a cationic state could be 110000111000. The format is defined as 20I1 so that spaces can be used for empty M.O.'s. MINDO/3 (C) The default Hamiltonian within MOPAC is MNDO, with the alternatives of AM1 and MINDO/3. To use the MINDO/3 Hamiltonian the keyword MINDO/3 should be used. Acceptable alternatives to the keyword MINDO/3 are MINDO and MINDO3. MMOK (C) If the system contains a peptide linkage, then MMOK will allow a molecular mechanics correc- tion to be applied so that the barrier to rotation is increased (to 14.00 kcal/mole in N-methyl acetamide). ______________________________________________________________Keywords_______ MODE (C) MODE is used in the EF routine. Normally the default MODE=1 is used to locate a transition state, but if this is incorrect, explicitly define the vector to be followed by using MODE=n. (MODE is not a recommended keyword). If you use the FORCE option when deciding which mode to follow, set all isotopic masses to 1.0. The normal modes from FORCE are normally mass-weighted; this can mislead. Alternatively, use LARGE with FORCE: this gives the force constants and vectors in addition to the mass-weighted normal modes. Only the mass-weighted modes can be drawn with DRAW. MS=n Useful for checking the MECI calculation and for teaching. MS=n overrides the normal choice of magnetic component of spin. Normally, if a triplet is requested, an MS of 1 will be used; this excludes all singlets. If MS=0 is also given, then singlets will also be calculated. The use of MS should not affect the values of the results at all. MULLIK (O) A full Mulliken Population analysis is to be done on the final RHF wavefunction. This involves the following steps: 1. The eigenvector matrix is divided by the square root of the overlap matrix, S. 2. The Coulson-type density matrix, P , is formed. 3. The overlap population is formed from P (i; j)S(i; j). 4. Half the off-diagonals are added onto the diagonals. NLLSQ (C) The gradient norm is to be minimized by Bartel's method. This is a Non-Linear Least Squares gradient minimization routine. Gradient minimization will locate one of three possible points: (a) A minimum in the energy surface. The gradient norm will go to zero, and the lowest five or six eigenvalues resulting from a FORCE calculation will be approximately zero. (b) A transition state. The gradient norm will vanish, as in (a), but in this case the system is characterized by one, and only one, negative force constant. (c) A local minimum in the gradient norm space. In this (normally unwanted) case the gradient norm is minimized, but does not go to zero. A FORCE calculation will not give the five or six zero eigenvalues characteristic of a stationary point. While normally undesirable, this is sometimes the only way to obtain a geometry. For instance, if a system is formed which cannot be characterized as an intermediate, and at the same time is not a transition state, but nonetheless has some chemical significance, then that state can be refined using NLLSQ. NOANCI (W) RHF open-shell derivatives are normally calculated using Liotard's analytical C.I. method. If this method is NOT to be used, specify NOANCI (NO ANalytical Configuration Interaction derivatives). NODIIS (W) In the event that the G-DIIS option is not wanted, NODIIS can be used. The G-DIIS normally accelerates the geometry optimization, but there is no guarantee that it will do so. If the heat of formation rises unexpectedly (i.e., rises during a geometry optimization while the GNORM is larger than about 0.3), then try NODIIS. 2.3_Definitions_of_keywords__________________________________________________ NOINTER (O) The interatomic distances are printed by default. If you do not want them to be printed, specify NOINTER. For big jobs this reduces the output file considerably. NOLOG (O) Normally a copy of the archive file will be directed to the LOG file, along with a synopsis of the job. If this is not wanted, it can be suppressed completely by NOLOG. NOMM (C) All four semi-empirical methods underestimate the barrier to rotation of a peptide bond. A Molecular Mechanics correction has been added which increases the barrier in N-methyl acetamide to 14 kcal/mole. If you do not want this correction, specify NOMM (NO Molecular Mechanics). NONR (W) Not recommended for normal use. Used with the EF routine. See source code for more details. NOTHIEL (W) In a normal geometry optimization using the BFGS routine, Thiel's FSTMIN technique is used. If normal line-searches are wanted, specify NOTHIEL. NOXYZ (O) The cartesian coordinates are printed by default. If you do not want them to be printed, specify NOXYZ. For big jobs this reduces the output file considerably. NSURF (C) In an ESP calculation, NSURF=n specifies the number of surface layers for the Connolly surface. OLDENS (W) A density matrix produced by an earlier run of MOPAC is to be used to start the current cal- culation. This can be used in attempts to obtain an SCF when a previous calculation ended successfully but a subsequent run failed to go SCF. OLDGEO (C) If multiple geometries are to be run, and the final geometry from one calculation is to be used to start the next calculation, OLDGEO should be specified. Example: If a MNDO, AM1, and PM3 calculation were to be done on one system, for which only a rough geometry was available, then after the MNDO calculation, the AM1 calculation could be done using the optimized MNDO geometry as the starting geometry, by specifying OLDGEO. OPEN(n1,n2) (C) The M.O. occupancy during the SCF calculation can be defined in terms of doubly occup- ied, empty, and fractionally occupied M.O.'s. The fractionally occupied M.O.'s are defined by OPEN(n1,n2), where n1 = number of electrons in the open-shell manifold, and n2 = number of open-shell M.O.'s; n1/n2 must be in the range 0 to 2. OPEN(1,1) will be assumed for odd-electron systems unless an OPEN keyword is used. Errors introduced by use of fractional occupancy are automatically corrected in a MECI calculation when OPEN(n1,n2) is used. ______________________________________________________________Keywords_______ ORIDE (W) Do not use this keyword until you have read Simons' article. ORIDE is part of the EF routine, and means "Use whatever 's are produced even if they would normally be `unacceptable'." J. Simons, P. Jorgensen, H. Taylor, J. Ozment, J. Phys. Chem. 87:2745 (1983). PARASOK (W) Use this keyword with extreme caution! The AM1 method has been parameterized for only a few elements, less than the number available to MNDO or PM3. If any elements which are not parameterized at the AM1 level are specified, the MNDO parameters, if available, will be used. The resulting mixture of methods, AM1 with MNDO, has not been studied to see how good the results are, and users are strictly on their own as far as accuracy and compatibility with other methods is concerned. In particular, while all parameter sets are referenced in the output, other programs may not cite the parameter sets used and thus compatibility with other MNDO programs is not guaranteed. PI (O) The normal density matrix is composed of atomic orbitals, that is s, px, py and pz. PI allows the user to see how each atom-atom interaction is split into oe and ss bonds. The resulting "density matrix" is composed of the following basis-functions:- s-oe, p-oe, p-ss, d-oe, d-ss, d-ffi. The on-diagonal terms give the hybridization state, so that an sp2 hybridized system would be represented as s-oe: 1.0, p-oe: 2.0, p-ss: 1.0. PM3 (C) The PM3 method is to be used. POINT=n (C) The number of points to be calculated on a reaction path is specified by POINT=n. Used only with STEP in a path calculation. POINT1=n (C) In a grid calculation, the number of points to be calculated in the first direction is given by POINT1=n. `n' should be less than 24; default: 11. POINT2=n (C) In a grid calculation, the number of points to be calculated in the second direction is given by POINT2=n. `n' should be less than 24, default: 11; POTWRT (W) In an ESP calculation, write out surface points and electrostatic potential values to UNIT 21. POLAR (C) The polarizability and first and second hyperpolarizabilities are to be calculated. At present this calculation does not work for polymers, but should work for all other systems. Two different options are implemented: the older finite field method and a new time-dependent Hartree-Fock method. 2.3_Definitions_of_keywords__________________________________________________ Time-Dependent Hartree-Fock This procedure is based on the detailed description given by M. Dupuis and S. Karna (J. Comp. Chem. 12, 487 (1991)). The program is capable of calculating: Frequency Dependent Polarizability alpha(-w;w) Second Harmonic Generation beta(-2w;w,w) Electrooptic Pockels Effect beta(-w;0,w) Optical Rectification beta(0;-w,w) Third Harmonic Generation gamma(-3w;w,w,w) DC-EFISH gamma(-2w;0,w,w) Optical Kerr Effect gamma(-w;0,0,w) Intensity Dependent Index of Refraction gamma(-w;w,-w,w) The input is given at the end of the MOPAC deck and consists of two lines of free-field input followed by a list energies. The variables on the first line are: Nfreq = How many energies will be used to calculate the desired quantities. Iwflb = Type of beta calculation to be performed. This valiable is only important if iterative beta calculations are chosen. 0 - static 1 - SHG 2 - EOPE 3 - OR Ibet = Type of beta calculation: 0 - beta(0;0) static 1 - iterative calculation with type of beta chosen by Iwflb. 1 - Noniterative calculation of SHG -2 - Noniterative calculation of EOPE -3 - Noniterative calculation of OR Igam = Type of gamma calculation: 0 - No gamma calculation 1 - THG 2 - DC-EFISH 3 - IDRI 4 - OKE The vaiables on the second line are: Atol = Cutoff tolerance for alpha calculations (1.0e-4 seems reasonable) Maxitu = Maximum number of iteractions for beta calculations Maxita = Maximum number of iterations for alpha calculations Btol = Cutoff tolerance for beta calculations Nfreq lines follow, each with an energy value in eV's at which the hyperpolarizabilites are to be calculated. POWSQ (C) Details of the working of POWSQ are printed out. This is only useful in debugging. PRECISE (W) The criteria for terminating all optimizations, electronic and geometric, are to be increased by a factor, normally, 100. This can be used where more precise results are wanted. If the results ______________________________________________________________Keywords_______ are going to be used in a FORCE calculation, where the geometry needs to be known quite precisely, then PRECISE is recommended; for small systems the extra cost in CPU time is minimal. PRECISE is not recommended for experienced users, instead GNORM=n.nn and SCFCRT=n.nn are suggested. PRECISE should only very rarely be necessary in a FORCE calculation: all it does is remove quartic contamination, which only affects the trivial modes significantly, and is very expensive in CPU time. PULAY (W) The default converger in the SCF calculation is to be replaced by Pulay's procedure as soon as the density matrix is sufficiently stable. A considerable improvement in speed can be achieved by the use of PULAY. If a large number of SCF calculations are envisaged, a sample calculation using 1SCF and PULAY should be compared with using 1SCF on its own, and if a saving in time results, then PULAY should be used in the full calculation. PULAY should be used with care in that its use will prevent the combined package of convergers (SHIFT, PULAY and the CAMP-KING convergers) from automatically being used in the event that the system fails to go SCF in (ITRY-10) iterations. The combined set of convergers very seldom fails. QUARTET (C) RHF interpretation: The desired spin-state is a quartet, i.e., the state with component of spin = 1/2 and spin = 3/2. When a configuration interaction calculation is done, all spin states of spin equal to, or greater than 1/2 are calculated simultaneously, for component of spin = 1/2. From these states the quartet states are selected when QUARTET is specified, and all other spin states, while calculated, are ignored in the choice of root to be used. If QUARTET is used on its own, then a single state, corresponding to an alpha electron in each of three M.O.'s is calculated. UHF interpretation: The system will have three more alpha electrons than beta electrons. QUINTET (C) RHF interpretation: The desired spin-state is a quintet, that is, the state with component of spin = 0 and spin = 2. When a configuration interaction calculation is done, all spin states of spin equal to, or greater than 0 are calculated simultaneously, for component of spin = 0. From these states the quintet states are selected when QUINTET is specified, and the septet states, while calculated, will be ignored in the choice of root to be used. If QUINTET is used on its own, then a single state, corresponding to an alpha electron in each of four M.O.'s is calculated. UHF interpretation: The system will have three more alpha electrons than beta electrons. RECALC=n RECALC=n calculates the Hessian every n steps in the EF optimization. For small n this is costly but is also very effective in terms of convergence. RECALC=10 and DMAX=0.10 can be useful for difficult cases. In extreme cases RECALC=1 and DMAX=0.05 will always find a stationary point, if it exists. RESTART (W) When a job has been stopped, for whatever reason, and intermediate results have been stored, then the calculation can be restarted at the point where it stopped by specifying RESTART. The most common cause of a job stopping before completion is its exceeding the time allocated. A saddle-point calculation has no restart, but the output file contains information which can easily be used to start the calculation from a point near to where it stopped. 2.3_Definitions_of_keywords__________________________________________________ It is not necessary to change the geometric data to reflect the new geometry. As a result, the geometry printed at the start of a restarted job will be that of the original data, not that of the restarted file. A convenient way to monitor a long run is to specify 1SCF and RESTART; this will give a normal output file at very little cost. Note 1: In the FORCE calculation two restarts are possible. These are (a) a restart in FLEPO if the geometry was not optimized fully before FORCE was called, and (b) the normal restart in the construction of the force matrix. If the restart is in FLEPO within FORCE then the keyword FORCE should be deleted, and the keyword RESTART used on its own. Forgetting this point is a frequent cause of failed jobs. Note 2: Two restarts also exist in the IRC calculation. If an IRC calculation stops while in the FORCE calculation, then a normal restart can be done. If the job stops while doing the IRC calculation itself then the keyword IRC=n should be changed to IRC, or it can be omitted if DRC is also specified. The absence of the string "IRC=" is used to indicate that the FORCE calculation was completed before the restart files were written. ROOT=n (C) The n'th root of a C.I. calculation is to be used in the calculation. If a keyword specifying the spin-state is also present, e.g. SINGLET or TRIPLET, then the n'th root of that state will be selected. Thus ROOT=3 and SINGLET will select the third singlet root. If ROOT=3 is used on its own, then the third root will be used, which may be a triplet, the third singlet, or the second singlet (the second root might be a triplet). In normal use, this keyword would not be used. It is retained for educational and research purposes. Unusual care should be exercised when ROOT= is specified. ROT=n (C) In the calculation of the rotational contributions to the thermodynamic quantities the symmetry number of the molecule must be supplied. The symmetry number of a point group is the number of equivalent positions attainable by pure rotations. No reflections or improper rotations are allowed. This number cannot be assumed by default, and may be affected by subtle modifications to the molecule, such as isotopic substitution. A list of the most important symmetry numbers follows: ---- TABLE OF SYMMETRY NUMBERS ---- C1 CI CS 1 D2 D2D D2H 4 C(INF)V 1 C2 C2V C2H 2 D3 D3D D3H 6 D(INF)H 2 C3 C3V C3H 3 D4 D4D D4H 8 T TD 12 C4 C4V C4H 4 D6 D6D D6H 12 OH 24 C6 C6V C6H 6 S6 3 SADDLE (C) The transition state in a simple chemical reaction is to be optimized. Extra data are required. After the first geometry, specifying the reactants, and any symmetry functions have been defined, the second geometry, specifying the products, is defined, using the same format as that of the first geometry. SADDLE often fails to work successfully. Frequently this is due to equivalent dihedral angles in the reactant and product differing by about 360 degrees rather than zero degrees. As the choice of dihedral can be difficult, users should consider running this calculation with the keyword XYZ. There is normally no ambiguity in the definition of cartesian coordinates. See also BAR=. Many of the bugs in SADDLE have been removed in this version. Use of the XYZ option is strongly recommended. ______________________________________________________________Keywords_______ SCALE (C) SCALE=n.n specifies the scaling factor for Van der Waals' radii for the initial layer of the Connolly surface in the ESP calculation. SCFCRT=n.nn (W) The default SCF criterion is to be replaced by that defined by SCFCRT=. The SCF criterion is the change in energy in kcal/mol on two successive iterations. Other minor criteria may make the requirements for an SCF slightly more stringent. The SCF criterion can be varied from about 0.001 to 1.D-25, although numbers in the range 0.0001 to 1.D-9 will suffice for most applications. An overly tight criterion can lead to failure to achieve a SCF, and consequent failure of the run. SCINCR=n.nn In an ESP calculation, SCINCR=n.nn specifies the increment between layers of the surface in the Connolly surface. (default: 0.20) SETUP (C) If, on the keyword line, the word `SETUP' is specified, then one or two lines of keywords will be read from a file with the logical name SETUP. The logical file SETUP must exist, and must contain at least one line. If the second line is defined by the first line as a keyword line, and the second line contains the word SETUP, then one line of keywords will be read from a file with the logical name SETUP. SETUP=name (C) Same as SETUP, only the logical or actual name of the SETUP file is `name'. SEXTET (C) RHF interpretation: The desired spin-state is a sextet: the state with component of spin = 1/2 and spin = 5/2. The sextet states are the highest spin states normally calculable using MOPAC in its unmodi- fied form. If SEXTET is used on its own, then a single state, corresponding to one alpha electron in each of five M.O.'s, is calculated. If several sextets are to be calculated, say the second or third, then OPEN(n1,n2) should be used. UHF interpretation: The system will have five more alpha electrons than beta electrons. SHIFT=n.nn (W) In an attempt to obtain an SCF by damping oscillations which slow down the convergence or prevent an SCF being achieved, the virtual M.O. energy levels are shifted up or down in energy by a shift technique. The principle is that if the virtual M.O.'s are changed in energy relative to the occupied set, then the polarizability of the occupied M.O.'s will change pro rata. Normally, oscillations are due to autoregenerative charge fluctuations. The SHIFT method has been re-written so that the value of SHIFT changes automatically to give a critically-damped system. This can result in a positive or negative shift of the virtual M.O. energy levels. If a non-zero SHIFT is specified, it will be used to start the SHIFT technique, rather than the default 15eV. If SHIFT=0 is specified, the SHIFT technique will not be used unless normal convergence techniques fail and the automatic "ALL CONVERGERS : : :" message is produced. 2.3_Definitions_of_keywords__________________________________________________ SIGMA (C) The McIver-Komornicki gradient norm minimization routines, POWSQ and SEARCH are to be used. These are very rapid routines, but do not work for all species. If the gradient norm is low, i.e., less than about 5 units, then SIGMA will probably work; in most cases, NLLSQ is recommended. SIGMA first calculates a quite accurate Hessian matrix, a slow step, then works out the direction of fastest decent, and searches along that direction until the gradient norm is minimized. The Hessian is then partially updated in light of the new gradients, and a fresh search direction found. Clearly, if the Hessian changes markedly as a result of the line-search, the update done will be inaccurate, and the new search direction will be faulty. SIGMA should be avoided if at all possible when non-variationally optimized calculations are being done. If the Hessian is suspected to be corrupt within SIGMA it will be automatically recalculated. This frequently speeds up the rate at which the transition state is located. If you do not want the Hessian to be reinitialized _ it is costly in CPU time _ specify LET on the keyword line. SINGLET (C) When a configuration interaction calculation is done, all spin states are calculated simultaneously, either for component of spin = 0 or 1/2. When only singlet states are of interest, then SINGLET can be specified, and all other spin states, while calculated, are ignored in the choice of root to be used. Note that while almost every even-electron system will have a singlet ground state, SINGLET should still be specified if the desired state must be a singlet. SINGLET has no meaning in a UHF calculation, but see also TRIPLET. SLOPE (C) In an ESP calculation, SLOPE=n.nn specifies the scale factor for MNDO charges. (default=1.422) SPIN (O) The spin matrix, defined as the difference between the alpha and beta density matrices, is to be printed. If the system has a closed-shell ground state, e.g. methane run UHF, the spin matrix will be null. If SPIN is not requested in a UHF calculation, then the diagonal of the spin matrix, that is the spin density on the atomic orbitals, will be printed. STEP (C) In a reaction path, if the path step is constant, STEP can be used instead of explicitly specifying each point. The number of steps is given by POINT. If the reaction coordinate is an interatomic distance, only positive STEPs are allowed. STEP1=n.nnn (C) In a grid calculation the step size in degrees or Angstroms for the first of the two parameters is given by n.nnn. By default, an 11 by 11 grid is generated. See POINT1 and POINT2 on how to adjust this number. The first point calculated is the supplied geometry, and is in the upper left hand corner. This is a change from Version 5.00, where the supplied geometry was the central point. ______________________________________________________________Keywords_______ STEP2=n.nnn (C) In a grid calculation the step size in degrees or Angstroms for the second of the two parameters is given by n.nnn. STO3G (W) In an ESP calculation STO3G means "Use the STO-3G basis set to de-orthogonalize the semiem- pirical orbitals". SYMAVG (W) Used by the ESP, SYMAVG will average charges which should have the same value by symmetry. SYMMETRY (C) Symmetry data defining related bond lengths, angles and dihedrals can be included by supplying additional data after the geometry has been entered. If there are any other data, such as values for the reaction coordinates, or a second geometry, as required by SADDLE, then it would follow the symmetry data. Symmetry data are terminated by one blank line. For non-variationally optimized systems symmetry constraints can save a lot of time because many derivatives do not need to be calculated. At the same time, there is a risk that the geometry may be wrongly specified, e.g. if methane radical cation is defined as being tetrahedral, no indication that this is faulty will be given until a FORCE calculation is run. (This system undergoes spontaneous Jahn-Teller distortion.) Usually a lower heat of formation can be obtained when SYMMETRY is specified. To see why, consider the geometry of benzene. If no assumptions are made regarding the geometry, then all the C-C bond lengths will be very slightly different, and the angles will be almost, but not quite 120 degrees. Fixing all angles at 120 degrees, dihedrals at 180 or 0 degrees, and only optimizing one C-C and one C-H bond-length will result in a 2-D optimization, and exact D6h symmetry. Any deformation from this symmetry must involve error, so by imposing symmetry some error is removed. The layout of the symmetry data is: ,... where the numerical code for is given in the table of symmetry functions below. For example, ethane, with three independent variables, can be defined as: SYMMETRY ETHANE, D3D NA NB NC C C 1.528853 1 1 H 1.105161 1 110.240079 1 2 1 H 1.105161 0 110.240079 0 120.000000 0 2 1 3 H 1.105161 0 110.240079 0 240.000000 0 2 1 3 H 1.105161 0 110.240079 0 60.000000 0 1 2 3 H 1.105161 0 110.240079 0 180.000000 0 1 2 3 H 1.105161 0 110.240079 0 300.000000 0 1 2 3 0 0.000000 0 0.000000 0 0.000000 0 0 0 0 3, 1, 4, 5, 6, 7, 8, 3, 2, 4, 5, 6, 7, 8, Here atom 3, a hydrogen, is used to define the bond lengths (symmetry relation 1) of atoms 4,5,6,7 and 8 with the atoms they are specified to bond with in the NA column of the data file; 2.3_Definitions_of_keywords__________________________________________________ similarly, its angle (symmetry relation 2) is used to define the bond-angle of atoms 4,5,6,7 and 8 with the two atoms specified in the NA and NB columns of the data file. The other angles are point-group symmetry defined as a multiple of 60 degrees. Spaces, tabs or commas can be used to separate data. Note that only three parameters are marked to be optimized. The symmetry data can be the last line of the data file unless more data follows, in which case a blank line must be inserted after the symmetry data. The full list of available symmetry relations is as follows: 1 BOND LENGTH IS SET EQUAL TO THE REFERENCE BOND LENGTH 2 BOND ANGLE IS SET EQUAL TO THE REFERENCE BOND ANGLE 3 DIHEDRAL ANGLE IS SET EQUAL TO THE REFERENCE DIHEDRAL ANGLE 4 DIHEDRAL ANGLE VARIES AS 90 DEGREES - REFERENCE DIHEDRAL 5 DIHEDRAL ANGLE VARIES AS 90 DEGREES + REFERENCE DIHEDRAL 6 DIHEDRAL ANGLE VARIES AS 120 DEGREES - REFERENCE DIHEDRAL 7 DIHEDRAL ANGLE VARIES AS 120 DEGREES + REFERENCE DIHEDRAL 8 DIHEDRAL ANGLE VARIES AS 180 DEGREES - REFERENCE DIHEDRAL 9 DIHEDRAL ANGLE VARIES AS 180 DEGREES + REFERENCE DIHEDRAL 10 DIHEDRAL ANGLE VARIES AS 240 DEGREES - REFERENCE DIHEDRAL 11 DIHEDRAL ANGLE VARIES AS 240 DEGREES + REFERENCE DIHEDRAL 12 DIHEDRAL ANGLE VARIES AS 270 DEGREES - REFERENCE DIHEDRAL 13 DIHEDRAL ANGLE VARIES AS 270 DEGREES + REFERENCE DIHEDRAL 14 DIHEDRAL ANGLE VARIES AS THE NEGATIVE OF THE REFERENCE DIHEDRAL 15 BOND LENGTH VARIES AS HALF THE REFERENCE BOND LENGTH 16 BOND ANGLE VARIES AS HALF THE REFERENCE BOND ANGLE 17 BOND ANGLE VARIES AS 180 DEGREES - REFERENCE BOND ANGLE 18 BOND LENGTH IS A MULTIPLE OF REFERENCE BOND-LENGTH Function 18 is intended for use in polymers, in which the translation vector may be a multiple of some bond-length. 1,2,3 and 14 are most commonly used. Abbreviation: SYM. SYMMETRY is not available for use with cartesian coordinates. T= (W) This is a facility to allow the program to shut down in an orderly manner on computers with execution time cpu limits. The total cpu time allowed for the current job is limited to nn.nn seconds; by default this is one hour, i.e., 3600 seconds. If the next cycle of the calculation cannot be completed without running a risk of exceeding the assigned time the calculation will write a restart file and then stop. The safety margin is 100 percent; that is, to do another cycle, enough time to do at least two full cycles must remain. Alternative specifications of the time are T=nn.nnM, this defines the time in minutes, T=nn.nnH, in hours, and T=nn.nnD, in days, for very long jobs. This keyword cannot be written with spaces around the `=' sign. THERMO (O) The thermodynamic quantities, internal energy, heat capacity, partition function, and entropy can be calculated for translation, rotation and vibrational degrees of freedom for a single temperature, or a range of temperatures. Special situations such as linear systems and transition states are accommodated. The approximations used in the THERMO calculation are invalid below 100K, ______________________________________________________________Keywords_______ and checking of the lower bound of the temperature range is done to prevent temperatures of less than 100K being used. Another limitation, for which no checking is done, is that there should be no internal rotations. If any exist, they will not be recognized as such, and the calculated quantities will be too low as a result. In order to use THERMO the keyword FORCE must also be specified, as well as the value for the symmetry number; this is given by ROT=n. If THERMO is specified on its own, then the default values of the temperature range are assumed. This starts at 200K and increases in steps of 10 degrees to 400K. Three options exist for overriding the default temperature range. These are: THERMO(nnn) (O) The thermodynamic quantities for a 200 degree range of temperatures, starting at nnnK and with an interval of 10 degrees are to be calculated. THERMO(nnn,mmm) (O) The thermodynamic quantities for the temperature range limited by a lower bound of nnn Kelvin and an upper bound of mmm Kelvin, the step size being calculated in order to give approximately 20 points, and a reasonable value for the step. The size of the step in Kelvin degrees will be 1, 2, or 5, or a power of 10 times these numbers. THERMO(nnn,mmm,lll) (O) Same as for THERMO(nnn,mmm), only now the user can explicitly define the step size. The step size cannot be less than 1K. T-PRIORITY (O) In a DRC calculation, results will be printed whenever the calculated time changes by 0.1 fem- toseconds. Abbreviation, T-PRIO. T-PRIORITY=n.nn (O) In a DRC calculation, results will be printed whenever the calculated time changes by n.nn fem- toseconds. TRANS (C) The imaginary frequency due to the reaction vector in a transition state calculation must not be included in the thermochemical calculation. The number of genuine vibrations considered can be: 3N - 5 for a linear ground state system, 3N - 6 for a non-linear ground state system, or 3N - 6 for a linear transition-state complex, 3N - 7 for a non-linear transition-state complex. This keyword must be used in conjunction with THERMO if a transition state is being calcu- lated. TRANS=n (C) The facility exists to allow the THERMO calculation to handle systems with internal rotations. TRANS=n will remove the n lowest vibrations. Note that TRANS=1 is equivalent to TRANS on its own. For xylene, for example, TRANS=2 would be suitable. This keyword cannot be written with spaces around the `=' sign. 2.3_Definitions_of_keywords__________________________________________________ TRIPLET (C) The triplet state is defined. If the system has an odd number of electrons, an error message will be printed. UHF interpretation The number of alpha electrons exceeds that of the beta electrons by 2. If TRIPLET is not specified, then the numbers of alpha and beta electrons are set equal. This does not necessarily correspond to a singlet. RHF interpretation An RHF MECI calculation is performed to calculate the triplet state. If no other C.I. keywords are used, then only one state is calculated by default. The occupancy of the M.O.'s in the SCF calculation is defined as (: : :2,1,1,0,: : :), that is, one electron is put in each of the two highest occupied M.O.'s. See keywords C.I.=n and OPEN(n1,n2). TS (C) Within the Eigenvector Following routine, the option exists to optimize a transition state. To do this, use TS. Preliminary indications are that the TS method is much faster and more reliable than either SIGMA or NLLSQ. TS appears to work well with cartesian coordinates. In the event that TS does not converge on a stationary point, try adding RECALC=5 to the keyword line. UHF (C) The unrestricted Hartree-Fock Hamiltonian is to be used. VECTORS (O) The eigenvectors are to be printed. In UHF calculations both alpha and beta eigenvectors are printed; in all cases the full set, occupied and virtual, are output. The eigenvectors are normalized to unity, that is the sum of the squares of the coefficients is exactly one. If DEBUG is specified, then ALL eigenvectors on every iteration of every SCF calculation will be printed. This is useful in a learning context, but would normally be very undesirable. VELOCITY (C) The user can supply the initial velocity vector to start a DRC calculation. Limitations have to be imposed on the geometry in order for this keyword to work. These are (a) the input geometry must be in cartesian coordinates, (b) the first three atoms must not be coaxial, (c) triatomic systems are not allowed (See geometry specification - triatomic systems are in internal coordinates, by definition.) Put the velocity vector after the geometry as three data per line, representing the x, y, and z components of velocity for each atom. The units of velocity are centimeters per second. The velocity vector will be rotated so as to suit the final cartesian coordinate orientation of the molecule. If KINETIC=n.n is also specified, the velocity vector will be scaled to equal the velocity corresponding to n.n kcal/mole. This allows the user to define the direction of the velocity vector; the magnitude is given by KINETIC=n.n. ______________________________________________________________Keywords_______ WILLIAMS (C) Within the ESP calculation, the Connolly surface is used as the default. If the surface generation procedure of Donald Williams is wanted, the keyword WILLIAMS should be used. X-PRIORITY (O) In a DRC calculation, results will be printed whenever the calculated geometry changes by 0.05 A. The geometry change is defined as the linear sum of the translation vectors of motion for all atoms in the system. Abbreviation, X-PRIO. X-PRIORITY=n.nn (O) In a DRC calculation, results will be printed whenever the calculated geometry changes by n.nn A. XYZ (W) The SADDLE calculation quite often fails due to faulty definition of the second geometry because the dihedrals give a lot of difficulty. To make this option easier to use, XYZ was developed. A calculation using XYZ runs entirely in cartesian coordinates, thus eliminating the problems associated with dihedrals. The connectivity of the two systems can be different, but the numbering must be the same. Dummy atoms can be used; these will be removed at the start of the run. A new numbering system will be generated by the program, when necessary. XYZ is also useful for removing dummy atoms from an internal coordinate file; use XYZ and 0SCF. If a large ring system is being optimized, sometimes the closure is difficult, in which case XYZ will normally work. Except for SADDLE, do not use XYZ by default: use it only when something goes wrong! In order for XYZ to be used, the supplied geometry must either be in cartesian coordinates or, if internal coordinates are used, symmetry must not be used, and all coordinates must be flagged for optimization. If dummy atoms are present, only 3N-6 coordinates need to be flagged for optimization. If at all possible, the first 3 atoms should be real. Except in SADDLE, XYZ will still work if one or more dummy atoms occur before the fourth real atom, in which case more than 3N-6 coordinates will be flagged for optimization. This could cause difficulties with the EF method, which is why dummy atoms at the start of the geometry specification should be avoided. The coordinates to be optimized depend on the internal coordinate definition of real atoms 1, 2, and 3. If the position of any of these atoms depends on dummy atoms, then the optimization flags will be different from the case where the first three atoms defined are all real. The geometry is first converted to cartesian coordinates and dummy atoms excluded. The cartesian coordinates to be optimized are: Atoms R R R R R X R X R X R R R X X X R X X X R X X X X Y Z X Y Z X Y Z X Y Z X Y Z X Y Z X Y Z X Y Z Atom 1 2 + + + + + + + + + + + + + + + + + 3 + + + + + + + + + + + + + + + + + + + + + + + 4 on + + + + + + + + + + + + + + + + + + + + + + + + Where R and X apply to real and dummy atoms in the internal coordinate Z-matrix, and atoms 1, 2, 3, and 4 are the real atoms in cartesian coordinates. A `+' means that the relevant coordinate is flagged for optimization. Note that the number of flagged coordinates varies from 3N - 6 to 3N - 3, atom 1 is never optimized. 2.4_Keywords_that_go_together________________________________________________ 2.4 Keywords that go together Normally only a subset of keywords are used in any given piece of research. Keywords which are related to each other in this way are: 1. In getting an SCF: SHIFT, PULAY, ITRY, CAMP, SCFCRT, 1SCF, PL. 2. In C.I. work: SINGLET, DOUBLET, etc., OPEN(n,m), C.I.=(n,m), LARGE, MECI, MS=n, VECTORS, ESR, ROOT=n, MICROS. 3. In excited states: UHF with (TRIPLET, QUARTET, etc.), C.I.=n, C.I.=(n,m). 4. In geometry optimization: (a) Using BFGS: GNORM=n.n, XYZ, PRECISE. (b) Using EF: GNORM=n.n, XYZ, PRECISE (c) Using NLLSQ: GNORM=n.n, XYZ, PRECISE (d) Using SIGMA: GNORM=n.n, XYZ, PRECISE 5. In Gaussian work: AIGIN, AIGOUT, AIDER. 6. In SADDLE: XYZ, BAR=n.n ______________________________________________________________Keywords_______ Chapter 3 Geometry specification FORMAT: The geometry is read in using essentially "Free-Format" of FORTRAN-77. In fact, a character input is used in order to accommodate the chemical symbols, but the numeric data can be regarded as "free-format".indexdata!free-format This means that integers and real numbers can be interspersed, numbers can be separated by one or more spaces, a tab and/or by one comma. If a number is not specified, its value is set to zero. The geometry can be defined in terms of either internal or cartesian coordinates. 3.1 Internal coordinate definition For any one atom (i) this consists of an interatomic distance in Angstroms from an already-defined atom (j), an interatomic angle in degrees between atoms i and j and an already defined k, (k and j must be different atoms), and finally a torsional angle in degrees between atoms i, j, k, and an already defined atom l (l cannot be the same as k or j). See also dihedral angle coherency. Exceptions: 1. Atom 1 has no coordinates at all: this is the origin. 2. Atom 2 must be connected to atom 1 by an interatomic distance only. 3. Atom 3 can be connected to atom 1 or 2, and must make an angle with atom 2 or 1 (thus 3-2-1 or 3-1-2); no dihedral is possible for atom 3. By default, atom 3 is connected to atom 2. 3.1.1 Constraints 1. Interatomic distances must be greater than zero. Zero Angstroms is acceptable only if the parameter is symmetry-related to another atom, and is the dependent function. 2. Angles must be in the range 0.0 to 180.0, inclusive. This constraint is for the benefit of the user only; negative angles are the result of errors in the construction of the geometry, and angles greater than 180 degrees are fruitful sources of errors in the dihedrals. 3. Dihedrals angles must be definable. If atom i makes a dihedral with atoms j, k, and l, and the three atoms j, k, and l are in a straight line, then the dihedral has no definable angle. During the calculation this constraint is checked continuously, and if atoms j, k, and l lie within 0.02 Angstroms of a straight line, the calculation will output an error message and then stop. Two exceptions to this constraint are: (a) if the angle is zero or 180 degrees, in which case the dihedral is not used. (b) if atoms j, k, and l lie in an exactly straight line (usually the result of a symmetry constraint), as in acetylene, acetonitrile, but-2-yne, etc. ____________________________________________________Geometry_specification_____________ If the exceptions are used, care must be taken to ensure that the program does not violate these constraints during any optimizations or during any calculations of derivatives - see also FORCE. Conversion to Cartesian Coordinates By definition, atom 1 is at the origin of cartesian coordinate space_be careful, however, if atom 1 is a dummy atom. Atom 2 is defined as lying on the positive X axis _ for atom 2, Y=0 and Z=0. Atom 3 is in the X-Y plane unless the angle 3-2-1 is exactly 0 or 180 degrees. Atom 4, 5, 6, etc. can lie anywhere in 3-D space. 3.2 Gaussian Z-matrices With certain limitations, geometries can now be specified within MOPAC using the Gaussian Z-matrix format. Exceptions to the full Gaussian standard 1. The option of defining an atom's position by one distance and two angles is not allowed. In other words, the N4 variable described in the Gaussian manual must either be zero or not specified. MOPAC requires the geometry of atoms to be defined in terms of, at most, one distance, one angle, and one dihedral. 2. Gaussian cartesian coordinates are not supported. 3. Chemical symbols must not be followed by an integer identifying the atom. Numbers after a symbol are used by MOPAC to indicate isotopic mass. If labels are desired, they should be enclosed in parentheses, thus "Cl(on C5)34.96885". 4. The connectivity (N1, N2, N3) must be integers. Labels are not allowed. Specification of Gaussian Z-matrices The information contained in the Gaussian Z-matrix is identical to that in a MOPAC Z-matrix. The order of presentation is different. Atom N, (real or dummy) is specified in the format: Element N1 Length