Constructing Z-Matrices

This section presents a brief overview of traditional Z-matrix descriptions of molecular systems. There are restrictions on the size of a Z-matrix: the maximum number of variables and the maximum number of atoms within a calculation. These are set consistently for a maximum of 250,000 real atoms (including ghost but not dummy atoms), and a maximum of 250,000 Z-matrix centers (atoms, ghost atoms, and dummy atoms).

Using Internal Coordinates

Each line of a Z-matrix gives the internal coordinates for one of the atoms within the molecule. The most-used Z-matrix format uses the following syntax:

Element-label, atom 1, bond-length, atom 2, bond-angle, atom 3, dihedral-angle [,format-code]

Although these examples use commas to separate items within a line, any valid separator may be used. Element-label is a character string consisting of either the chemical symbol for the atom or its atomic number. If the elemental symbol is used, it may be optionally followed by other alphanumeric characters to create an identifying label for that atom. A common practice is to follow the element name with a secondary identifying integer: C1, C2, etc.

Atom1, atom2, atom3 are the labels for previously-specified atoms and are used to define the current atoms’ position. Alternatively, the other atoms’ line numbers within the molecule specification section may be used for the values of variables, where the charge and spin multiplicity line is line 0.

The position of the current atom is then specified by giving the length of the bond joining it to atom1, the angle formed by this bond and the bond joining atom1 and atom2, and the dihedral (torsion) angle formed by the plane containing atom1, atom2 and atom3 with the plane containing the current atom, atom1 and atom2. Note that bond angles must be in the range 0° < angle < 180°. Dihedral angles may take on any value.

The optional format-code parameter specifies the format of the Z-matrix input. For the syntax being described here, this code is always 0. This code is needed only when additional parameters follow the normal Z-matrix specification data, as in an ONIOM calculation.

As an initial example, consider hydrogen peroxide. A Z-matrix for this structure would be:

H
O 1 0.9
O 2 1.4 1 105.0
H 3 0.9 2 105.0 1 120.0

The first line of the Z-matrix simply specifies a hydrogen. The next line lists an oxygen atom and specifies the internuclear distance between it and the hydrogen as 0.9 Angstroms. The third line defines another oxygen with an O-O distance of 1.4 Angstroms (i.e., from atom 2, the other oxygen) and having an O-O-H angle (with atoms 2 and 1) of 105 degrees. The fourth and final line is the only one for which all three internal coordinates need be given. It defines the other hydrogen as bonded to the second oxygen with an H-O distance of 0.9 Angstroms, an H-O-O angle of 105 degrees and a H-O-O-H dihedral angle of 120 degrees.

Variables may be used to specify some or all of the values within the Z-matrix. Here is another version of the previous Z-matrix:

H
O 1 R1
O 2 R2 1 A
H 3 R1 2 A 1 D
  Variables:
R1 0.9
R2 1.4
A 105.0
D 120.0

Symmetry constraints on the molecule are reflected in the internal coordinates. The two H-O distances are specified by the same variable, as are the two H-O-O bond angles. When such a Z-matrix is used for a geometry optimization in internal coordinates (Opt=Z-matrix), the values of the variables will be optimized to locate the lowest energy structure. For a full optimization (FOpt), the variables are required to be linearly independent and include all degrees of freedom in the molecule. For a partial optimization (POpt), variables in a second section (often labeled Constants:) are held fixed in value while those in the first section are optimized:

  Variables:
R1 0.9
R2 1.4
A 105.0
  Constants:
D 120.0

See the examples in the discussion of the Opt keyword for more information about optimizations in internal coordinates.

Mixing Internal and Cartesian Coordinates

Cartesian coordinates are actually a special case of the Z-matrix, as in this example:

C   0.00   0.00   0.00
C   0.00   0.00   1.52
H   1.02   0.00  -0.39
H  -0.51  -0.88  -0.39
H  -0.51   0.88  -0.39
H  -1.02   0.00   1.92
H   0.51  -0.88   1.92
H   0.51   0.88   1.92

It is also possible to use both internal and Cartesian coordinates within the same Z-matrix, as in this example:

O 0 xo  0.  zo
C 0 0.  yc  0.
C 0 0. -yc  0.
N 0 xn  0.  0. 
H 2 r1 3 a1 1  b1
H 2 r2 3 a2 1  b2
H 3 r1 2 a1 1 -b1
H 3 r2 2 a2 1 -b2
H 4 r3 2 a3 3  d3 
  Variables: 
xo -1.
zo  0.
yc  1.
xn  1.
r1 1.08
r2 1.08
r3 1.02
a1 125.
a2 125.
d3 160.
b1  90.
b2 -90.

This Z-matrix has several features worth noting:

  • The variable names for the Cartesian coordinates are given symbolically in the same manner as for internal coordinate variables.

  • The integer 0 after the atomic symbol indicates symbolic Cartesian coordinates to follow.

  • Cartesian coordinates can be related by a sign change just as dihedral angles can.

Alternate Z-matrix Format

An alternative Z-matrix format allows nuclear positions to be specified using two bond angles rather than a bond angle and a dihedral angle. This is indicated by a 1 in an additional field following the second angle (this field defaults to 0, which indicates a dihedral angle as the third component):

C4 O1 0.9 C2 120.3 O2 180.0 0
C5 O1 1.0 C2 110.4 C4 105.4 1
C6 O1 R C2 A1 C3 A2 1

The first line uses a dihedral angle while the latter two use a second bond angle.

Using Dummy Atoms

This section will illustrate the use of dummy atoms within Z-matrices, which are represented by the pseudo atomic symbol X. The following example illustrates the use of a dummy atom to fix the three-fold axis in C3v ammonia:

N
X 1 1.
H 1 nh 2 hnx
H 1 nh 2 hnx 3  120.0
H 1 nh 2 hnx 3 -120.0
 
nh 1.0
hnx 70.0

The position of the dummy on the axis is irrelevant, and the distance 1.0 used could have been replaced by any other positive number. hnx is the angle between an N-H bond and the threefold axis.

Here is a Z-matrix for oxirane:

X
C1  X halfcc
O   X     ox C1 90.
C2  X halfcc  O 90. C1 180.0
H1 C1     ch  X hcc  O  hcco
H2 C1     ch  X hcc  O -hcco
H3 C2     ch  X hcc  O  hcco
H4 C2     ch  X hcc  O -hcco
 
halfcc   0.75
ox       1.0
ch       1.08
hcc    130.0
hcco   130.0

This example illustrates two points. First, a dummy atom is placed at the center of the C-C bond to help constrain the cco triangle to be isosceles. ox is then the perpendicular distance from O to the C-C bond, and the angles oxc are held at 90 degrees. Second, some of the entries in the Z-matrix are represented by the negative of the dihedral angle variable hcco.

The following examples illustrate the use of dummy atoms for specifying linear bonds. Geometry optimizations in internal coordinates are unable to handle bond angles of l80 degrees which occur in linear molecular fragments, such as acetylene or the C4 chain in butatriene. Difficulties may also be encountered in nearly linear situations such as ethynyl groups in asymmetrical molecules. These situations can be avoided by introducing dummy atoms along the angle bisector and using the half-angle as the variable or constant:

N
C 1 cn
X 2 1. 1 90.
H 2 ch 3 90. 1 180.
 
cn 1.20
ch 1.06

Similarly, in this Z-matrix intended for a geometry optimization, half represents half of the N-C-O angle which is expected to be close to linear. Note that a value of half less than 90 degrees corresponds to a cis arrangement:

N
C 1 cn
X 2 1. 1 half
O 2 co 3 half 1 180.0
H 4 oh 2  coh 3   0.0
 
cn 1.20
co 1.3
oh 1.0
half 80.0
coh 105.

Model Builder Geometry Specifications

The model builder is another facility within Gaussian for quickly specifying certain sorts of molecular systems [Pople67a]. It is requested with the ModelA or ModelB options to the Geom keyword, and it requires additional input in a separate section within the job file.

The basic input to the model builder is called a short formula matrix, a collection of lines, each of which defines an atom (by atomic symbol) and its connectivity, by up to six more entries. Each of these can be either an integer, which is the number of the line defining another explicitly specified atom to which the current atom is bonded, or an atomic symbol (e.g. H, F) to which the current atom is connected by a terminal bond, or a symbol for a terminal functional group which is bonded to the current atom. The functional groups currently available are OH, NH2, Me, Et, NPr, IPr, NBu, IBu, and TBu.

The short formula matrix also implicitly defines the rotational geometry about each bond in the following manner. Suppose atoms X and Y are explicitly specified. Then X will appear in row Y and Y will appear in row X. Let I be the atom to the right of X in row Y and J be the atom to the right of Y in row X. Then atoms I and J are put in the trans orientation about the X-Y bond. The short formula matrix may be followed by optional lines modifying the generated structure. There are zero or more of each of the following lines, which must be grouped together in the order given here:

AtomGeom,I,Geom
Normally the local geometry about an atom is defined by the number and types of bond about the atom (e.g., carbon in methane is tetrahedral, in ethylene is trigonal, etc.). All bond angles at one center must be are equal. The AtomGeom line changes the value of the bonds at center I. Geom may be the angle as a floating point number, or one of the strings Tetr, Pyra, Trig, Bent, or Line.

BondRot,I,J,K,L,Geom
This changes the orientations of the I-J and K-L bonds about the J-K bond. Geom is either the dihedral angle or one of the strings Cis (≥0), Trans (≥180), Gaup (≥+60), or Gaum (≥–60).

BondLen,I,J,NewLen
This sets the length of the I-J bond to NewLen (a floating point value).

The model builder can only build structures with atoms in their normal valencies. If a radical is desired, its extra valence can be effectively “tied down” using dummy atoms, which are specified by a minus sign before the atomic symbol (e.g., –H). Only terminal atoms can be dummy atoms.

The two available models (A and B) differ in that model A takes into account the type (single, double, triple, etc.) of a bond in assigning bond lengths, while model B bond lengths depend only on the types of the atoms involved. Model B is available for all atoms from H to Cl except He and Ne. If Model A is requested and an atom is used for which no Model A bond length is defined, the appropriate Model B bond length is used instead.

 


Last update:4 October 2013