PMC:4264129 / 21409-66736
Annnotations
2_test
{"project":"2_test","denotations":[{"id":"25353178-24694084-52044411","span":{"begin":745,"end":747},"obj":"24694084"},{"id":"25353178-24758690-52044412","span":{"begin":911,"end":913},"obj":"24758690"},{"id":"25353178-19994881-52044413","span":{"begin":1317,"end":1319},"obj":"19994881"},{"id":"25353178-16955487-52044414","span":{"begin":2970,"end":2972},"obj":"16955487"},{"id":"25353178-20649313-52044415","span":{"begin":4748,"end":4750},"obj":"20649313"},{"id":"25353178-21786796-52044416","span":{"begin":6538,"end":6540},"obj":"21786796"},{"id":"25353178-22957541-52044417","span":{"begin":7739,"end":7741},"obj":"22957541"},{"id":"25353178-22066681-52044418","span":{"begin":12414,"end":12416},"obj":"22066681"},{"id":"25353178-12666158-52044419","span":{"begin":14038,"end":14040},"obj":"12666158"},{"id":"25353178-18512970-52044420","span":{"begin":14044,"end":14046},"obj":"18512970"},{"id":"25353178-19366259-52044421","span":{"begin":14047,"end":14049},"obj":"19366259"},{"id":"25353178-11849023-52044422","span":{"begin":14326,"end":14328},"obj":"11849023"},{"id":"25353178-11749344-52044423","span":{"begin":14329,"end":14331},"obj":"11749344"},{"id":"25353178-16092826-52044424","span":{"begin":14332,"end":14334},"obj":"16092826"},{"id":"25353178-16955487-52044425","span":{"begin":15078,"end":15080},"obj":"16955487"},{"id":"25353178-23480798-52044426","span":{"begin":15403,"end":15405},"obj":"23480798"},{"id":"25353178-22731938-52044427","span":{"begin":15406,"end":15408},"obj":"22731938"},{"id":"25353178-23480798-52044428","span":{"begin":15824,"end":15826},"obj":"23480798"},{"id":"25353178-22731938-52044429","span":{"begin":15827,"end":15829},"obj":"22731938"},{"id":"25353178-11389651-52044430","span":{"begin":17850,"end":17852},"obj":"11389651"},{"id":"25353178-17001413-52044431","span":{"begin":17853,"end":17855},"obj":"17001413"},{"id":"25353178-17690772-52044432","span":{"begin":17856,"end":17858},"obj":"17690772"},{"id":"25353178-19099446-52044433","span":{"begin":17871,"end":17873},"obj":"19099446"},{"id":"25353178-20336250-52044434","span":{"begin":17874,"end":17876},"obj":"20336250"},{"id":"25353178-24691775-52044435","span":{"begin":17985,"end":17987},"obj":"24691775"},{"id":"25353178-19787906-52044436","span":{"begin":17994,"end":17996},"obj":"19787906"},{"id":"25353178-23480798-52044437","span":{"begin":18030,"end":18032},"obj":"23480798"},{"id":"25353178-22731938-52044438","span":{"begin":18033,"end":18035},"obj":"22731938"},{"id":"25353178-23902476-52044439","span":{"begin":19585,"end":19587},"obj":"23902476"},{"id":"25353178-23480798-52044440","span":{"begin":21114,"end":21116},"obj":"23480798"},{"id":"25353178-2660832-52044441","span":{"begin":22268,"end":22270},"obj":"2660832"},{"id":"25353178-24889800-52044442","span":{"begin":25901,"end":25903},"obj":"24889800"},{"id":"25353178-22977886-52044443","span":{"begin":26539,"end":26541},"obj":"22977886"},{"id":"25353178-14634989-52044444","span":{"begin":36418,"end":36421},"obj":"14634989"},{"id":"25353178-15224394-52044445","span":{"begin":36422,"end":36425},"obj":"15224394"},{"id":"25353178-20136072-52044446","span":{"begin":36766,"end":36769},"obj":"20136072"},{"id":"25353178-18190187-52044447","span":{"begin":36830,"end":36833},"obj":"18190187"},{"id":"25353178-10032153-52044448","span":{"begin":37066,"end":37069},"obj":"10032153"},{"id":"25353178-17690778-52044449","span":{"begin":40070,"end":40073},"obj":"17690778"},{"id":"25353178-19344174-52044450","span":{"begin":40514,"end":40517},"obj":"19344174"},{"id":"25353178-21090628-52044451","span":{"begin":42180,"end":42183},"obj":"21090628"},{"id":"25353178-22510106-52044452","span":{"begin":42367,"end":42370},"obj":"22510106"},{"id":"25353178-22510106-52044453","span":{"begin":43126,"end":43129},"obj":"22510106"},{"id":"T94000","span":{"begin":745,"end":747},"obj":"24694084"},{"id":"T36350","span":{"begin":911,"end":913},"obj":"24758690"},{"id":"T74369","span":{"begin":1317,"end":1319},"obj":"19994881"},{"id":"T70397","span":{"begin":2970,"end":2972},"obj":"16955487"},{"id":"T39245","span":{"begin":4748,"end":4750},"obj":"20649313"},{"id":"T30423","span":{"begin":6538,"end":6540},"obj":"21786796"},{"id":"T45873","span":{"begin":7739,"end":7741},"obj":"22957541"},{"id":"T88980","span":{"begin":12414,"end":12416},"obj":"22066681"},{"id":"T34137","span":{"begin":14038,"end":14040},"obj":"12666158"},{"id":"T68207","span":{"begin":14044,"end":14046},"obj":"18512970"},{"id":"T34756","span":{"begin":14047,"end":14049},"obj":"19366259"},{"id":"T27515","span":{"begin":14326,"end":14328},"obj":"11849023"},{"id":"T35936","span":{"begin":14329,"end":14331},"obj":"11749344"},{"id":"T85148","span":{"begin":14332,"end":14334},"obj":"16092826"},{"id":"T10148","span":{"begin":15078,"end":15080},"obj":"16955487"},{"id":"T95716","span":{"begin":15403,"end":15405},"obj":"23480798"},{"id":"T74391","span":{"begin":15406,"end":15408},"obj":"22731938"},{"id":"T64622","span":{"begin":15824,"end":15826},"obj":"23480798"},{"id":"T76855","span":{"begin":15827,"end":15829},"obj":"22731938"},{"id":"T71646","span":{"begin":17850,"end":17852},"obj":"11389651"},{"id":"T98212","span":{"begin":17853,"end":17855},"obj":"17001413"},{"id":"T66731","span":{"begin":17856,"end":17858},"obj":"17690772"},{"id":"T79755","span":{"begin":17871,"end":17873},"obj":"19099446"},{"id":"T90748","span":{"begin":17874,"end":17876},"obj":"20336250"},{"id":"T99869","span":{"begin":17985,"end":17987},"obj":"24691775"},{"id":"T88854","span":{"begin":17994,"end":17996},"obj":"19787906"},{"id":"T46350","span":{"begin":18030,"end":18032},"obj":"23480798"},{"id":"T71841","span":{"begin":18033,"end":18035},"obj":"22731938"},{"id":"T18212","span":{"begin":19585,"end":19587},"obj":"23902476"},{"id":"T74184","span":{"begin":21114,"end":21116},"obj":"23480798"},{"id":"T15269","span":{"begin":22268,"end":22270},"obj":"2660832"},{"id":"T34853","span":{"begin":25901,"end":25903},"obj":"24889800"},{"id":"T56576","span":{"begin":26539,"end":26541},"obj":"22977886"},{"id":"T85984","span":{"begin":36418,"end":36421},"obj":"14634989"},{"id":"T49409","span":{"begin":36422,"end":36425},"obj":"15224394"},{"id":"T92061","span":{"begin":36766,"end":36769},"obj":"20136072"},{"id":"T30576","span":{"begin":36830,"end":36833},"obj":"18190187"},{"id":"T54844","span":{"begin":37066,"end":37069},"obj":"10032153"},{"id":"T10244","span":{"begin":40070,"end":40073},"obj":"17690778"},{"id":"T99573","span":{"begin":40514,"end":40517},"obj":"19344174"},{"id":"T52305","span":{"begin":42180,"end":42183},"obj":"21090628"},{"id":"T49361","span":{"begin":42367,"end":42370},"obj":"22510106"},{"id":"T83679","span":{"begin":43126,"end":43129},"obj":"22510106"}],"text":"2. Methodology\n\n2.1. Experimental Methods\nAs discussed above, the existence of an intramolecular H-bond was accepted in this review on the basis of gas-phase experiments. In a number of the studies the spectra were recorded for jet-cooled systems. References to these are provided when the gas-phase and in-solution molecular structures are compared in Section 3. The jet-cooling technique [23] allows reaching local temperatures as low as about 5 K, with the advantage that the molecules assume their vibrational ground states. Under these conditions, the vibrational spectrum becomes simpler and different conformers can be more easily identified and characterized. Microwave spectra were recorded at room temperature or somewhat below [24,25,26,27]. Gas-phase electron diffraction structure determinations, sometimes at two or three different temperatures, were performed in the range of 297–733 K [28,29,30,31].\nFor in-solution IR spectra, the temperature was generally room temperature or not far from it in the experiments surveyed here. In these cases, the boiling point of the solvent imposes a limit for the upper temperature. It is known that signals can split in NMR studies by lowering the temperature. The lowest operational temperature is constrained, however, by the freezing point of the NMR solvent [32].\nIn summary, the gas-phase structural parameters were obtained from experiments conducted in a very large temperature range of 5–733 K. The in-solution investigations were mainly performed near room temperature. Thus, comparisons of the structural data between gas-phase and in-solution experiments, as well as to theoretical calculations referring to 0 K, need caution.\n\n2.2. Geometry Optimization\n\n2.2.1. General Problems\nThe first step in a quantum-chemical structure and energy analysis is the optimization of the molecular geometry. If more than one structural form (different conformers, tautomers) are to be considered, each of them has to be optimized. A very important point is the selection of a reliable theoretical method and the application of a satisfactorily large basis set. Clearly, one wants to obtain the best computational results possible within the technical limits of the given structural problem. When a seeming H-bonded system is under scrutiny, an additional problem emerges in that the system has to be identified whether it is really a hydrogen-bonding arrangement or not.\nIn the review [22], a reference list was provided for the theoretical approaches and basis sets most frequently used in the past 15–20 years for geometry optimizations and relative energy calculations for ground state, closed-shell systems. Geometries were mainly optimized at the ab initio Hartree-Fock (HF) and second-order Møller-Plesset perturbation theory (MP2) levels or using some DFT-based (density functional theory) method such as B3LYP or some more recent ones that account for the dispersion interaction like the B97D method of Grimme [33] or the M05 and M06 methods of Zhao and Truhlar [34]. For basis sets, the 6-311++G** Pople basis or correlation consistent basis sets like cc-pvXz or aug-cc-pvXz (X = d, t, q) [35] have been applied more often in the most recent studies. Relative energies of conformers and tautomers from ab initio calculations are sensitive to the applied level of theory (method + basis set). Accounting for the electron correlations beyond the MP2 approach has turned out to be very effective by the application of the coupled-cluster methods, CCSD and CCSD(T) (coupled-cluster singles and doubles and noniterative triples) [36,37]. Hobza proposed an extrapolation formula for calculating the molecular energy at the CBS (complete basis set level) utilizing the MP2 limit energy and the difference of the CCSD or CCSD(T) energy and the MP2 energy calculated with some smaller basis set [38]. Frequency analysis can verify local energy minimum geometries by finding all positive vibrational frequencies. Using classical partition functions for ideal-gas molecules, the free energies can be estimated at some temperature T and pressure p [39].\nWhile many papers have proven that the results are very sensitive to the level of theory, calculations applying high-level theoretical methods in combination with large basis sets may not be practical even for the case of small molecules. This is true not only for individual geometric parameters and energy values where it is normal that energy decreases with a higher-level method and/or a larger basis set, but even for the relative energies between conformations. Changes in the relative values suggest that the energy differences have not reached a converged limit value yet. A disappointing example was presented by DePrince and Mazziotti [40], who compared two tautomers of the CH3NO molecule at the CCSD and CCSD(T) levels. Whereas the relative energies were calculated at 29.1 and 21.4 kJ/mol, respectively, utilizing the cc-pvdz basis set, the CBS values are 2.8 and −6.2 kJ/mol. These changes are dramatic. They indicate that the basis set effect on the relative energy is very large and the selected methods at the CBS limit even predict different relative stabilities. The CCSD(T)/CBS result is perhaps more reliable, yet it can not be ascertained in the absence of experimental information whether the obtained value is an acceptable limit or even higher-level methods should be considered. Note that this problem can be noticed even for a very small molecule. For a larger molecule (e.g., 15–20 heavy atoms and corresponding hydrogens) upgrading the level of theory is even less practical. The situation could be worse regarding the optimized geometries. For relative energies, we can ascertain at least that the computed limit is questionable while for geometries there is no clue about the correct bond lengths, angles, and torsion angles. If they do not vary monotonically in parallel with the increasing level of theory, one may have even less idea about the correct limit values in the absence of experimental information.\nWhy are the above, otherwise well-known computational experiences important in a review regarding H-bonds? As was shown above, the critical point can emerge after a small distortion of the optimized 1,2-dihydroxybenzene geometry. This finding can be interpreted to mean that the H-bond is disrupted in the optimal geometry. Indeed, it is quite possible that the existence of the BCP is very sensitive to the structure as can be seen by the earlier the quoted notes of Contreras-García et al. [11]. Thus, the suspicion may emerge that the level of theory is not high enough when small geometric changes can create or perturb a H-bond. Using again the paper of Mandado et al., [7] as an example, the (3, −1) BCP was found on the B3LYP/6-31+G** density map, but the BCP disappeared at the B3LYP/6-311++G** level, and did not appear either when the B3LYP/6-311++G (3d,3p) density was studied. The authors considered the B3LYP/6-31+G** result as an artifact and attributed it to the lack of diffuse functions on the hydrogen atoms. Disappearance of the BCP with larger basis sets clearly indicates, however, the basis set effect on some calculated topological indeces, and calls for studying a reliable electron density map. Obtainment of the latter is perhaps possible at a very high theoretical level, but such calculations are not practical for larger molecules. On the other hand, if gradually increasing basis sets provide different predictions with respect to the existence of a BCP, then conclusions based on medium-size basis sets remain uncertain.\nTheoretical calculations are able to predict a shift in the X–H vibrational frequency if the bond is involved H-bond interaction. Gu et al., [41] studied the possible intramolecular H-bond for α-hydroxy acetic acid. The authors concluded that the red-shift of about 100 cm−1, based on former experimental results for the stretching frequency of the α-hydroxy group relative to that for a free O–H bond in methanol “can be attributed to internal OH…O= hydrogen bonding”.\nThe red-shift in the case of an intramolecular H-bond was also demonstrated, at least qualitatively, by HF/6-31G* calculations for the 1,2-ethanediol [42,43]. For the all-trans-OCCO conformer tTt (C2h symmetry, for the three-letter code see [44]), where the two oxygens are far from each other, the two O–H frequencies were calculated equally at 4124 cm−1. For the most stable OCCO gauche conformation, tG+g− (C1 symmetry) allowing for an intramolecular hydrogen bond, the two O–H frequencies differed in accord with the experimental finding (see below). The frequencies for the gauche form were calculated at 4095 and 4123 cm−1. The smaller value refers to the O–H vibration involved in intramolecular O–H…O bonding. The larger frequency is related to the free OH vibration in the tG+g− conformation, where “t” indicates the trans HOCC arrangement. As a free OH, its stretching frequency is practically not affected and is equal to that for the tTt conformer. The O…O and O…H distances are 277 and 236 pm, respectively, well within the structural parameter set accepted for a H-bond. Although the calculated high frequencies are generally overestimated by about 10% at the HF level in comparison with experimental values, the shift of the frequency for the O–H group involved in the intramolecular interaction has revealed. A similar conclusion can be drawn for the 2-OH benzoic acid, when the calculated phenolic O–H frequency in intramolecular interaction with the carbonyl oxygen is compared with the free OH stretching frequency, as 3952 vs. 4112 cm−1 [45].\nFlorio et al., [46] compared the OH stretching frequencies of the monomeric and dimeric forms of formic and benzoic acids. The experimental values showed a red-shift of 459 cm−1 upon formic acid dimerization. The calculated harmonic frequency differences were 556 and 435 cm−1 at the B3LYP/aug-cc-pvtz and MP2/aug-cc-pvtz levels, respectively. For formic acid and its dimer, the geometry optimizations by the two methods led to very similar structure parameters, generally also close to the experimental values. The predicted frequencies at the MP2/aug-cc-pvtz level showed consistent overestimations for the monomer and the formic acid dimer (FAD), resulting in a red-shift close to the experimentally observed value. B3LYP/aug-cc-pvtz calculations provided, however, a larger overestimation for the monomer than for the dimer, leading to an increased red-shift. The authors concluded that the results “provide strong evidence that the B3LYP method does not provide a quantitatively correct description of this aspect of the H-bonding in the FAD dimer”.\nUpon the benzoic acid dimerization, the red-shift was 217 cm−1 experimentally as compared with the theoretical value of 616 cm−1 calculated at the B3LYP/6-311+G(2d,2p)/B3LYP/6-31+G(d) level. In this case the red-shift was even more strongly overestimated than that for FAD (see above) by the B3LYP method. These calculations utilized, however, a smaller basis set, B3LYP/6-311+G(2d,2p) for the carboxylic group and 6-31+G(d) for the atoms of the phenyl rings. For the benzoic acid systems, the calculated frequency was overestimated for the monomer and underestimated for the dimer. This latter result differs from that for the formic acid dimer. The calculated large red-shift for the benzoic acid systems may be explained by the interplay of the method and basis set. Since it was already qualified by the authors that the method does not quantitatively describe the H-bond for the dimer of a simple carboxylic acid, this likely applies to the benzoic acid dimer as well.\nIn intramolecular hydrogen bonds, the geometry for both the H…Y distance and the X–H…Y angle is primarily determined by the covalent structure of the molecule. While three-atom hydrogen-bonded rings are extremely rare, the four-atom substructures (e.g., carboxylic group, amides) deserve special consideration. In most cases, a H-bond can be expected if the system can form a five to seven-member ring, including arrangements utilizing a polar H. Prototypes are indicated in Figure 1 and typical representatives of five- and six-member rings are shown Figure 2 and Figure 3. Seven-member rings can be formed for γ-substituted carboxylic acids, 1,4-disubstituted butanes with OH and/or NH2 substituents.\nLarger rings are probably not stable. Chen et al., [47] pointed out that no intramolecular H-bond exists in the prevailing conformer of 1,5-pentadiol and 1,6-hexadiol at room temperature. In these cases, formation of an intramolecular H-bond would require a ring conformation with eight and nine members, respectively. This is probably unfavorable due to entropy considerations even for seven-member rings. Nagy et al., [48] studied different conformers for γ-hydroxy-butyric acid. Although the lowest-energy conformer optimized at the MP2/6-311++G** level formed a seven-member ring with an O–H…O= H-bond, the free energy for this structure is higher by about 2 kJ/mol than that for the most stable gas-phase species where this bond is disrupted, as also found experimentally [49]. The results for the increased relative free energy suggest unfavorable entropy effects for the hydrogen-bonded seven-member ring. This explanation is supported by the argument of Blanco et al. [50], who investigated the gas-phase structure of γ-amino-butyric acid (GABA). Intramolecular H-bonds were noticed in both forms of N–H…O=C and N…H–O–C=O, although the two mostly populated species do not possess an intramolecular H-bond. In order to create such bonds, structures have to be formed which “contribute to decrease entropy and to increase the Gibbs energy” [50].\n\n2.2.2. Special Problems\nIn general, for the past twenty years optimized molecular geometries in solution have been obtained by applying a continuum solvent model. The idea was introduced and subsequently developed by the Tomasi group as the PCM model [51,52]. Since the 1990’s, different continuum solvent models [53,54,55,56,57,58,59] and extension beyond the dielectric approximation [60] have been developed to account for the solvent effects on the geometry and energy/free energy of dissolved molecules. Several reviews summarize these models and compare the results obtained from different approaches [52,61,62,63].\nThe basic idea in the widely used PCM method [51] is that the solute is placed in a cavity carved in the continuum dielectric solvent, and the solute and the solvent mutually polarize each other. As a consequence, the solute’s geometry changes slightly and its internal energy increases when compared to its optimized gas-phase energy. The energy-increase is balanced by the developing solute-solvent electrostatic interaction energy. The final results are obtained through an iterative self-consistent-field (SCF) process that finds the total energy minimum and its related geometry. For the geometry optimization and energy/free energy calculations, all methods can be utilized, which were mentioned in relation to gas-phase calculations [33,34,35,36,37,38]. Thus, geometry optimizations can be performed by means of the HF, MP2 and DFT methods, and higher level energy calculations can be performed up to the CCSD(T)/CBS level. The customary basis set for geometry optimization and frequency analysis is 6-311++G**, but even the aug-cc-pvtz set has been applied [15,64].\nWhen a molecule dissolves, a close molecular environment is encountered that is in contrast to the most frequently applied ideal-gas model, where no potential energy interaction is considered even through the collisions of the molecules. Although the solute-solvent interactions are substantial, the effect of a non-polar or only slightly polar solvent (CCl4, CHCl3) on the molecular geometry is generally small [15,64]. The geometric effect could be, however, large when a solute with an intramolecular H-bond in the gas phase dissolves in a protic solvent such as water or methanol, which have both proton donor and acceptor sites. In this case, the X–H…Y intramolecular H-bond may collapse while solute-solvent H-bonds are formed using the free XH and Y sites.\nThe weakest point of the continuum dielectric solvent model is that the above solute-solvent H-bond(s) are only implicitly mimicked by polarization of the solvent and concomitant appearance of surface charges on the inner surface of the cavity: Negative surface charges opposite to a polar hydrogen and positive ones in the lone-pair regions of the solute’s oxygens and nitrogens. Although this response is qualitatively correct, the calculated solute-solvent stabilization energy is underestimated [65,66]. Thus, for proper calculation of the free energy changes when a polar solute with or without internal H-bond(s) dissolves in a protic solvent, explicit consideration of the solute-solvent intermolecular H-bonds becomes necessary.\nThis requirement can be largely satisfied by adopting the supermolecule + continuum approach, where the solute is surrounded by a number of explicit solvent molecules. The solute and the explicit solvent molecules mimic the H-bonds in the first solvation shell within the cavity carved in the continuum solvent. The critical question then becomes, how many explicit solvent molecules are to be considered.\nFor constructing the starting geometry of a supermolecule, knowledge of microsolvated solute structures is very helpful. In these systems, the central, polar molecule with or without an intramolecular H-bond is solvated by a few solvent molecules. Locations of the solvent molecules (water, methanol) indicate the most preferable solvation sites of the solute with a hydrogen donor/acceptor solvent. Useful information can be obtained from experimental gas-phase hydration/solvation studies augmented with theoretical calculations [12,13,14,16,23,67,68,69,70] or specific theoretical calculations for hydrated amino acid side chains, nucleotid base and sugar models [71,72,73,74,75].\nRecent calculations proved [15,64] that application of at least the aug-cc-pvtz basis set is required for reliable estimation of the relative solute free energies. If the solute has 6–10 C, N, O atoms and connected hydrogens, 500 basis functions could easily be required. If such a solute has to be surrounded by at least 5–6 water molecules, the number of basis functions increases to about 1000. The number of basis functions could be somewhat reduced by considering the solvent molecules with a lower basis set, with, e.g., 6-31+G**. While the supermolecule + continuum approach can be useful theoretically, it suffers from several technical challenges.\n(1) The geometry optimization for a system with 500–1000 basis functions is very slow in solution. If one wants to prove the local-energy-minimum character of the supermolecule and calculate thermal corrections, very small remaining forces should be allowed only at the end of the optimization. It is almost unreachable for a number of systems (or only by the application of the very time-consuming analytical second-derivative methods), in cases when torsion or intermolecular vibration frequencies could be as low as a few cm−1.\n(2) The number of explicit solvent molecules to be considered can become be critical. In a real, dilute solution the solute is surrounded by solvent molecules all around. Except for the simplest modeling cases like the partial solvation of γ-amino-butyric acid with 2–5 water molecules [76] or consideration of 3–8 water molecules during the HOCl catalyzed tautomerization of β-cyclopentadione [77], a considerably larger number of water molecules is generally required for reasonable modeling of the solvation sphere even for a small organic molecule. An impressive example was provided by Lu [78], who optimized the geometry of the Al(H2O)63+·12 H2O hydrate at the B3LYP/6-31+G(d,p) level in a water continuum by the PCM method. The resulting structure was of nearly spherical symmetry, easily allowing for the formation of the water network. Consideration of eighteen solvent molecules was necessary for mimicking the first and second hydration shells in a dilute solution.\n(3) The results of Lu and coworkers call attention to the need for the supermolecule to reasonably mimic the immediate in-solution environment of the solute. With a relatively small number of explicit solvent molecules within the supermolecule (for example, 3–4 water molecules, originally each of them facing a polar site), the water-water interactions may dominate over the solute-water interactions. Instead of forming 3–4 solute-water hydrogen bonds, a water cluster is then formed on some side of the solute and the number of water-solute hydrogen bonds would be smaller than expected in a water box with hundreds of water molecules. A successful tetrahydrate model in a continuum solvent was developed by Nagy for the syn-anti transformation of the acetic acid carboxylic group [79], whereas three waters in hydrogen bonds to the solute were not enough for modeling the immediate solvation environment of the transition state for 2F-phenol [15].\n(4) In general, only the first solvation shell around the polar sites can be modeled. Moreover, even in these cases, the explicit-solvent/continuum interface suffers from neglecting the consideration of the solvent-solvent hydrogen bonds. For methanol or acetonitrile solvents, the problem is not dramatic since the polar site of the solvent molecules should point toward the solute while the methyl group is located mainly on the outer surface of the supermolecule. Then the first-sphere solvent molecules can create a non-polar surface toward its continuum representation. This is surely not the case for explicit water molecules and is likewise questionable for a solvent like acetic acid with two stericly separated polar sites.\n(5) Geometry optimization for a supermolecule leads to an overly ordered structure, which is not maintained due to thermal disordering in a real solution.\n(6) If one wants to study the structure of a dilute or moderately concentrated (1 molar) solution as well as solute dimerization, boxes of a large number of explicit solvent molecules should be considered. These studies typically then require Monte Carlo (MC) [80] or molecular dynamics (MD) [81] simulations.\nDuring MC calculations, the solution model is a large solvent box with hundreds of solvent molecules and one or a few solute molecules embedded in the solvent. Atoms are represented by point-like centers characterized with atomic charges and assigned van der Waals parameters. The interaction energy of the atoms in different molecules is calculated by pair potentials and the total energy is the sum of these pair-interaction energies. Macroscopic thermodynamic quantities are estimated by averaging the individual values calculated for a large number of consecutive configurations. A configuration means a specific geometric arrangement of the elements in the solution box. The method is a probability method, meaning that a new configuration with modified geometric arrangement of the elements is considered for the above averaging upon the probability of the acceptance of the total energy change. If the new configuration is rejected, the old one is considered one more time in the averaging process. The most frequently applied sampling procedures are the Metropolis procedure [82] or some suitable procedure [83,84], which can accelerate the convergence of the calculated averages for thermodynamic quantities or help to more quickly reach an equilibrium solution structure by applying a biased energy calculation and probability for the acceptance of a new configuration. The goal is to generate a series of configurations corresponding to the Boltzmann distribution. For constant temperature simulations, the temperature is a parameter of the expression determining the acceptance probability. After having generated the required set of configurations, the average energy, enthalpy, volume, etc., can be calculated as an arithmetic mean of the individual values obtained with each configuration. If a biasing sampling was used, the probability of the acceptance has to be corrected before calculating averages.\nThe MD simulation is a deterministic process. The solution box is established as described above, but the atomic masses are also considered. A force field is used for calculating the total energy of the system with a given geometric arrangement (with Cartesian coordinates for each atom) at a reference time “t”. The force field contains terms accounting for the energy contributions by atoms bound along a 1–2–3–4 path, as well as for interactions of more remote atoms within the molecule and with all atoms in other molecules. The system generally is not in energy minimum, thus there are forces acting on the atoms. Using the gradient of the total energy, the forces acting on each atom can be determined. Applying Newton’s law, the position of the atoms at t + Δt can be calculated by means of the determined instantaneous velocities. Δt must be small, generally being chosen between 0.5 and 2 fs. In the latter case, the X–H distances are kept at a constant value. The temperature is related to the sum of the atomic kinetic energies. The simplest way to keep the temperature at a constant value is by scaling the determined atomic velocities or by coupling a thermostat to the system. Letting the simulation run long enough, sometimes for tens of nanoseconds, the average of the thermodynamic quantities can be obtained for a simulation period, where some structural characteristics, e.g., the solution density, have reached an equilibrium value already. By examining the trajectories calculated for geometric parameters of the solute, structural changes can be followed.\nUsing intermolecular pair-potentials such as OPLS-AA [85,86], Amber [87] or CHARMM22 [88], the largest problem is the development of the relevant atomic charges for the molecule. For example, Amber was originally parameterized for biopolymers and DNA, and no special charge parameters were available for, e.g., the HO–CHx–CHx–Y (Y = OH, NH2, NH3+, x = 1, 2) substructures. Furthermore, the OH and Y charges (and the CHx values, as well) should be conformation dependent, since, e.g., there is an intramolecular H-bond for the tG+g− conformer of 1,2-ethanediol, which is missing in the tTt form (Figure 2). Also, conformation dependent charges have to be used with Y = NH2, and NH3+ for the OCCN gauche and trans structures [89]. Recent developers of force fields suggest using molecular electrostatic potential (MEP) fitted charges, where the MEP should be obtained for the in-solution optimized solute. Since the solute and the continuum solvent mutually polarize each other, the MEP obtained at the end of the SCF procedure for the geometry optimization and energy minimization, reflects the electrostatic potential of a polarized solute in a polarized solvent environment. Charge fitting is a working tool, although different fitting methods (ES [90], RESP, [91], CHELPG [92]) lead to different results. Nonetheless, the problem is again of how to optimize the solute in a solvent environment. The MEP for the supermolecule is not relevant, since the solvent molecules are not so strictly bound to the solute due to thermal disorientation in a large solvent box as would follow from the structure of an optimized supermolecule. Furthermore, there is generally some charge transfer between the elements of the supermolecule. Although the programs force the charge of the total supermolecule to zero or some +/− integer for ionic solutes, the individual charges for the solute and the surrounding solvent molecules generally differ from 0, +/−1 etc. Thus, the MEP- fitted charges for a supermolecule should not be directly accepted for the atoms of the solute and applied in the MC of MD intermolecular pair-potentials. A possibility is that the solute geometry is accepted from the optimized supermolecule, and the MEP is fitted for the pure solute in a single-point calculation. This is, however, not a consistent procedure. The author has not found a good solution for this problem when surveying the literature.\nDespite the listed potential problems, the continuum solvent approach has been one of the most frequently used theoretical methods for characterizing the geometry and the energy/free energy for dissolved molecules. Since chemical equilibria depend on relative rather than absolute free energies, the problems mentioned above may not emerge in every case so harshly, and the errors could be partially cancelled. For example, the energy minimization in the supermolecule approach leads to too tightly bound water molecules. In a model, where the thermal disordering effect is also taken into consideration (MC and MD), a more loosely bound first solvation shell is expected. Nonetheless, since relative energy data are to be compared for the supermolecules with different solute conformations, the error is probably decreased. Also a more or less cancelled error may be expected regarding the interaction energy between the explicit solvent molecules at the outer surface of the supermolecules and the continuum. In a study for a series of compounds, it is a good practice to compare the computational results with available experimental values. Unfortunately, however, good-quality experimental results generally not are available in the literature for equilibria, where a number of conformers have been detected in solution.\n\n2.3. Free Energy Calculations\nThe focus of this subsection is the free energy calculation for explicit solvent models. The continuum-solvent calculations characterized above can provide free energies for individual solutes in any conformation, protonation state and tautomeric form. The method is not well suited, however, to the problem under investigation in the present survey pertaining to the possible disruption of intramolecular H-bonds in protic solvents. Since calculation of relative free energies is satisfactory for finding the more stable conformation, a perturbation method can be utilized in simulations where large solvent boxes including a solute and explicit solvent molecules are considered. The perturbation procedure is based on the work of Zwanzig and Jorgensen [93,94] and is widely used as the “free energy perturbation (FEP) method”.\nWhen the FEP method is applied, the atoms of solute and solvent molecules are characterized as sets of point charges with assigned van der Waals parameters. Locations of the solute charges symbolize those for atomic nuclei in the molecule. The combined effect of the accepted net solute charges should reproduce the in-solution molecular electrostatic potential nearly within and out of the van der Waals surface. To achieve this, it is useful to obtain the values of the point charges and determine their locations for the involved conformers/tautomers of the solute by geometry optimizations followed by the fit of charges to the in-solution MEP. In the applications of FEP below, the geometry of the solute with an intramolecular H-bond is converted into another structure without this bond. The atomic charges are also converted gradually and simultaneously from the starting set to final set, characterizing the corresponding conformations. The free energy is a state function, thus even if the intermediate states do not exist physically (when, e.g., the proton annihilation/proton development path is traveled) the sum of the perturbed free energy increments are theoretically correct. The molar free energy increment in the “i”th step, ΔGi is calculated as −RT ln[exp(−(Epi − Eri)/RT)]av, where the average of the exponential expression has to be calculated through a long simulation. Eri and Epi and are the total energies of the system per mole in a given configuration with parameters applied for the reference and perturbed solutes, respectively. The FEP in this case is led through controlled conformations or states. The constancy of the charge and geometry parameters is maintained only for a perturbation step. The actual parameters for the reference and perturbed states can be determined by a linear transformation of their values between the starting and the end structures. For example, conformations with and without an intramolecular H-bond correspond to perturbation parameters λ = 0 and λ = 1, respectively. In order to keep the free energy increments below about 4 kJ/mol, Δλ may be as small as about one hundredth.\nThe models used for water as solvent are generally TIP3P or TIP4P [20]. Some rigid, united atom CHn (n = 1–3) models also are available for small organic solvents such as CCl4, CHCl3, CH2Cl2, CH3CN, CH3OH [95] when the BOSS program (Biochemical and Organic Simulation System) [96] is used for Monte Carlo simulations [97,98,99]. The solvent models were parameterized for producing good density and heat of vaporization for the neat solvent, thus the accepted model implicitly accounts for the solvent-solvent interactions, including mutual polarizations of these molecules. Recently, however, one can create solvent boxes with all-atom solvent models by using the OPLS-AA 12–6–1 force field parameters [85,86].\nFigure 4 The free energy perturbation (FEP) curves for the transformations of conformers with an intramolecular H-bond to structures without H-bonds. Shown are 1,2-ethanediol (1) to (2), salicylic acid (3) to (4), and β-alanine zwitterion (5) to (6) where structure numbering is taken from Figure 2. Figure 4 is an illustration of the course of three FEP curves for conformational changes of the same molecules schematically compared in Figure 2. It is clear that the FEP, provided as the percent transformation, is not necessarily monotonic. For the two neutral molecules the most favorable hydration can be expected at about 20% and 60% transformations between the starting and finishing local energy minimum structures. For the gauche zwitterionic β-alanine, the ionic sites are less open for hydration and the solvation free energy the less favorable (5 is the trans, 6 is the gauche conformation).\nIn MD simulations, the solute geometry can change under the forces acting on each solute atom. Determination of bond stretching, bond bending and torsion parameters for every special molecule with possible intramolecular H-bond(s) is a very time consuming procedure. On the other hand, application of some average reference geometric parameters, stretching, bending force constants, and acceptance of average torsion potentials for these rather special molecules may lead to unreliably distorted structures in a flexible solute model. Thus use of the rigid optimized geometry could be favored from energy point of view and the computer time must be shorter if the energies of the solution configurations are to be averaged for a model with all rigid elements as compared with a slower convergence in case of a flexible solute. This option is not available for MD simulations, where individual molecular mechanics parameterization is desirable for high quality simulations. In some applications of the software, the lengths of the σ-bonds with hydrogen can be kept at fixed values although the problem of the critical torsion flexibility still remains.\nA further problem related to the use of the flexible solute model is that the MEP-fitted-charges characterize only the given optimized geometry. If the solute leaves this structure even temporarily, the charges are not relevant for the new geometry. Common in most MD and MC programs used in the past twenty years, however, is that the atomic point charges do not change through the simulation. In MD simulations with flexible solute geometry, the atomic charges are kept constant even if the structure changes among quite different conformational states.\nThis problem may be overcome by the use of the fluctuating charge (FC) model [100,101]. This approach introduces a polarizable force field, where the atomic point charges are allowed to fluctuate in response to the environment. Accordingly, this approach can account for the conformation dependence of the charges through the calculation of the solute-solvent interaction energies relevant to the actual solute geometry. The computation time increases by only about 10%, thus the method is applicable for simulations of large systems [102,103]. Nonetheless, MD/FEP calculation using the FC model has not been found even for small molecules in the surveyed literature.\nThe present review concentrates on results obtained with the described methods, although more recent simulation programs allow consideration of polarization charges on the solute and induced dipoles on the solvent [104]. The QM/MM (quantum mechanics/molecular mechanics) method [105,106] corresponds to the state-of-the-art level, but no article has been found where the QM/MM procedure was used for resolving the problem addressed in the title of this paper. The Car-Parinello molecular dynamics procedure (CPMD) [107] is another high-level theoretical method that could be applied as a state-of the-art procedure. The method applies pseudopotentials and the plane-wave basis set with periodic boundary conditions. The primary advantage of CPMD in comparison with ab initio molecular dynamics methods is that by introducing the extended variable Lagrangian formalism, CPMD can avoid the time demanding self-consistent matrix diagonalization at every step in the trajectory. Despite the attractive features of the method, the treatment of the electronic structure allows the application of CPMD only for systems that are remarkably smaller than those which can be easily considered in classical simulations. Although the possible disruption of the intramolecular H-bond in solution has been investigated only for small solutes below, the need to consider several hundred solvent molecules must have prevented applying CPMD because no such study has been found in the literature search.\n\n2.4. Dimeric Solutes\nThe issue to be considered in this section is similar to those that have been discussed above. Solutes with two polar sites for the monomer may form one or even two hydrogen bond(s) within a dimer, which is/are intramolecular from the perspective of this species. Then the created intramolecular H-bond(s) compete(s) with the intermolecular H-bonds between the monomeric solutes in the dissociated form and the solvent molecules. Specifically, monomers with HX–C=O (X = N, O) and N–C–OH substructures would belong to this category. Formation of an intramolecular H-bond in a four-member ring with two polar functionals would lead to a strained structure. Although covalently bound four-member rings exist, a moderately strong H-bond could not maintain this relationship. B97D/aug-cc-pvtz geometry optimization [79] found geometry parameters for the gas-phase acetic acid carboxylic group very close to experimental values [28]. Accordingly, the H…O= distance is 230 pm and the O–H…O= angle is 75.5°. This bond angle is quite far from the favorable O–H…O angle of about 170° generally found to be favorable intermolecular H-bond. Thus the OCOH moiety should not be considered to be a ring, and the better way to stabilize the HX–C=O or N–C–OH substructure is the formation of a dimer.\nThe gas-phase structure of formic acid was determined from the microwave spectrum by Lerner et al. [108]. The main geometric parameters were obtained by their fit to the rotational constants. However, formic acid assumes mostly a dimeric structure (FAD) in the gas phase. In a theoretical study, Turi [109] identified seven stable dimeric structures on the potential energy hypersurface. The doubly-hydrogen-bonded isomer of C2h symmetry was found to be the most stable arrangement. Experiments confirmed this theoretical prediction. A comprehensive discussion of the formic acid dimer related issues, the paradigm of symmetric hydrogen bonding, and a collection of former experimental papers were provided by Zielke and Suhm [110]. Rotationally resolved spectra were recorded under supersonic jet conditions for the FAD by Matylitsky et al. [111]. With the assumption of unperturbed monomers, a center-of-mass distance of R = 299.0 ± 0.1 pm for the monomers within the dimer was calculated from the spectroscopic results. A recent update of the experimental results on FAD augmented with theoretical calculations up to the MP2/aug-cc-pvtz level was provided by Balabin [112].\nGeometric parameters for monomeric and dimeric acetic and propionic acids in the gas phase were experimentally determined by Derissen [28,113]. Structures of the short-chain carboxylic acids to be discussed below are shown in Figure 5.\nFigure 5 Structures of the syn (11) and anti (12) acetic acid, the s-cis propenic acid (13) and the s-trans pyruvic acid (14). The carboxylic group is syn for the latter two. Simple aliphatic carboxylic acids assume predominantly doubly H-bonded, symmetrical dimeric forms (52%–87%) in the gas phase. The intermolecular H-bonds for acetic acid–acetic acid methyl ester complexes were studied by Emmeluth and Suhm [114] by FT-IR (Fourier transform infrared) spectroscopy for the mixed clusters in supersonic jet expansions. The methyl ester has two hydrogen-bond acceptor sites without having a strong donor site. The structural issues for the closest analogues of the acetic acid dimer and the acid-ester complexes were discussed on the basis of the recorded spectra and theoretical calculations at the MP2/6-311++G** and B3LYP/6-31G+G* levels. An interesting question is then: Will the dimeric forms be maintained in aqueous solution, or will the stabilizing “intramolecular” hydrogen bonds become disrupted so that the monomers can form intermolecular H-bonds with the water molecules.\nAnother important problem is the protonation state of the elements in the complexes of short-chain acids and bases. Such hydrogen-bonded complexes are stably formed from a neutral acid and a neutral amine in the gas phase, although acid—Guanidine complexes in the form of a hydrogen-bonded ion-pair are also stable in the gas phase [115]. In aqueous solution, the acid-amine ion-pair complexes are more stable, whereas the hydrogen-bonded complexes of the neutral elements are more stable in a low-polarity environment [116]. Such studies are important for the exploration of the ligand-receptor interactions in the binding cavity of the muscarinic acetylcholine receptor. At physiological pH of 7.4, a ligand with an amine function must be mainly protonated in the aqueous environment characteristic for the extracellular side of a transmembrane protein. When the ligand penetrates into the protein, most of the surrounding water molecules must be left behind. In the binding cavity of the receptor, at about 1100 pm from the surface, the ligand interacts with an aspartic acid side chain. According to a general consensus, the side chain is originally deprotonated. Thus a theoretical study of the protonation states for the partners under the modeling condition is justified [116].\nThe studies for dimer/complex formations in solution start with the geometry optimization of the associated species in the selected solvent. Using the continuum solvent approach, the poor handling of the solute-solvent H-bonds is less dramatic for the dimers of aliphatic acids. The main H-bond forming sites are involved in bonds to the partner, thus a less polar molecular surface of the dimer is seen by the solvent within the cavity. An acetic acid—Alkyl amine (mainly trimethyl amine) complex exhibits even larger non-polar molecular surface. The polar sites interact with each other in the depth of the complex, irrespective of whether the H-bond formed between the constituents is neutral or ionic.\nThe possible dimerization/complex formation in solution can be followed by calculating the potential of mean force (pmf) curve. This curve reflects the change of the solution free energy as a function of the solutes’ separation, taking the free energy of the solution with largely separated solutes as the reference state. If the pmf is calculated by the FEP procedure, the “R” separation of two reference atoms of the solutes should change only slightly by, e.g., 20 pm, and the free energy at separations R ± 10 pm could be calculated as a perturbation. Local minima of a pmf indicate stably associated forms, local maxima indicate barrier tops for association. The pmf may have more than one minimum site, also could decrease monotonically from the large-solute-separation reference state until reaching the minimum. The latter shape of the pmf indicates solute association without activation free energy. At small solute separations generally below 350 pm, the curves steeply ascend due to the quickly increasing van der Waals repulsions.\nFor large, more than 500–600 pm separation of the reference atoms, acceptance of the monomeric geometry and the related net atomic charges is reasonable. When the formation of the intermolecular H-bond(s) may start between the solutes, the geometry and the charge parameters should be gradually transformed into the values that were obtained for the optimized dimer. The related charges were fitted to the corresponding in-solution MEP [79,117].\n"}
NEUROSES
{"project":"NEUROSES","denotations":[{"id":"T413","span":{"begin":2641,"end":2647},"obj":"PATO_0000608"},{"id":"T414","span":{"begin":2821,"end":2828},"obj":"PATO_0001019"},{"id":"T415","span":{"begin":2829,"end":2839},"obj":"PATO_0001510"},{"id":"T416","span":{"begin":2882,"end":2888},"obj":"PATO_0001484"},{"id":"T417","span":{"begin":3085,"end":3095},"obj":"PATO_0000990"},{"id":"T418","span":{"begin":3195,"end":3201},"obj":"PATO_0001484"},{"id":"T419","span":{"begin":3289,"end":3298},"obj":"PATO_0000516"},{"id":"T420","span":{"begin":4158,"end":4167},"obj":"PATO_0000516"},{"id":"T421","span":{"begin":3371,"end":3379},"obj":"CHEBI_10545"},{"id":"T422","span":{"begin":3460,"end":3471},"obj":"CHEBI_33232"},{"id":"T423","span":{"begin":3487,"end":3494},"obj":"CHEBI_33731"},{"id":"T424","span":{"begin":3530,"end":3537},"obj":"CHEBI_33731"},{"id":"T425","span":{"begin":3663,"end":3669},"obj":"PATO_0001021"},{"id":"T426","span":{"begin":3732,"end":3738},"obj":"PATO_0001021"},{"id":"T427","span":{"begin":3781,"end":3787},"obj":"PATO_0001021"},{"id":"T428","span":{"begin":3800,"end":3806},"obj":"PATO_0001021"},{"id":"T429","span":{"begin":3888,"end":3894},"obj":"PATO_0001021"},{"id":"T430","span":{"begin":4403,"end":4409},"obj":"PATO_0001021"},{"id":"T431","span":{"begin":4441,"end":4447},"obj":"PATO_0001021"},{"id":"T432","span":{"begin":4618,"end":4624},"obj":"PATO_0001021"},{"id":"T433","span":{"begin":3852,"end":3861},"obj":"PATO_0000044"},{"id":"T434","span":{"begin":4009,"end":4012},"obj":"PATO_0001737"},{"id":"T435","span":{"begin":4013,"end":4022},"obj":"CHEBI_25367"},{"id":"T436","span":{"begin":4330,"end":4339},"obj":"CHEBI_25367"},{"id":"T437","span":{"begin":4028,"end":4032},"obj":"PATO_0002316"},{"id":"T438","span":{"begin":4067,"end":4078},"obj":"PATO_0000146"},{"id":"T439","span":{"begin":4085,"end":4093},"obj":"PATO_0001025"},{"id":"T440","span":{"begin":4214,"end":4218},"obj":"PATO_0000469"},{"id":"T441","span":{"begin":4265,"end":4270},"obj":"PATO_0000586"},{"id":"T442","span":{"begin":4324,"end":4329},"obj":"PATO_0000587"},{"id":"T443","span":{"begin":4429,"end":4435},"obj":"PATO_0000461"},{"id":"T444","span":{"begin":4672,"end":4677},"obj":"PATO_0000002"},{"id":"T445","span":{"begin":4787,"end":4792},"obj":"CHEBI_48431"},{"id":"T446","span":{"begin":4787,"end":4792},"obj":"CHEBI_16397"},{"id":"T447","span":{"begin":4793,"end":4801},"obj":"CHEBI_25367"},{"id":"T448","span":{"begin":5467,"end":5475},"obj":"CHEBI_25367"},{"id":"T449","span":{"begin":5091,"end":5096},"obj":"PATO_0000586"},{"id":"T450","span":{"begin":5270,"end":5277},"obj":"PATO_0000462"},{"id":"T451","span":{"begin":6008,"end":6015},"obj":"PATO_0000462"},{"id":"T452","span":{"begin":5327,"end":5332},"obj":"PATO_0000002"},{"id":"T453","span":{"begin":5461,"end":5466},"obj":"PATO_0000587"},{"id":"T454","span":{"begin":5512,"end":5517},"obj":"PATO_0000582"},{"id":"T455","span":{"begin":5518,"end":5523},"obj":"CHEBI_33250"},{"id":"T456","span":{"begin":6210,"end":6215},"obj":"PATO_0000587"},{"id":"T457","span":{"begin":6623,"end":6628},"obj":"PATO_0000587"},{"id":"T458","span":{"begin":6244,"end":6264},"obj":"CHEBI_18135"},{"id":"T459","span":{"begin":6334,"end":6343},"obj":"PATO_0001507"},{"id":"T460","span":{"begin":6436,"end":6445},"obj":"PATO_0000516"},{"id":"T461","span":{"begin":6453,"end":6462},"obj":"PATO_0000141"},{"id":"T462","span":{"begin":6606,"end":6610},"obj":"PATO_0000469"},{"id":"T463","span":{"begin":7320,"end":7324},"obj":"PATO_0000469"},{"id":"T464","span":{"begin":6787,"end":6794},"obj":"PATO_0001019"},{"id":"T465","span":{"begin":6913,"end":6920},"obj":"PATO_0001019"},{"id":"T466","span":{"begin":7252,"end":7259},"obj":"PATO_0001019"},{"id":"T467","span":{"begin":6859,"end":6862},"obj":"CHEBI_52027"},{"id":"T468","span":{"begin":7031,"end":7038},"obj":"PATO_0001566"},{"id":"T469","span":{"begin":7056,"end":7064},"obj":"CHEBI_49637"},{"id":"T470","span":{"begin":7056,"end":7064},"obj":"CHEBI_18276"},{"id":"T471","span":{"begin":7065,"end":7070},"obj":"CHEBI_33250"},{"id":"T472","span":{"begin":7243,"end":7251},"obj":"CHEBI_10545"},{"id":"T473","span":{"begin":7395,"end":7404},"obj":"CHEBI_25367"},{"id":"T474","span":{"begin":7563,"end":7567},"obj":"PATO_0000117"},{"id":"T475","span":{"begin":7673,"end":7682},"obj":"PATO_0000044"},{"id":"T476","span":{"begin":7792,"end":7799},"obj":"CHEBI_43176"},{"id":"T477","span":{"begin":7844,"end":7847},"obj":"PATO_0000322"},{"id":"T478","span":{"begin":8071,"end":8074},"obj":"PATO_0000322"},{"id":"T479","span":{"begin":7929,"end":7938},"obj":"PATO_0000044"},{"id":"T480","span":{"begin":8796,"end":8805},"obj":"PATO_0000044"},{"id":"T481","span":{"begin":8946,"end":8955},"obj":"PATO_0000044"},{"id":"T482","span":{"begin":7985,"end":7989},"obj":"PATO_0002316"},{"id":"T483","span":{"begin":8824,"end":8828},"obj":"PATO_0002316"},{"id":"T484","span":{"begin":8922,"end":8926},"obj":"PATO_0002316"},{"id":"T485","span":{"begin":8002,"end":8010},"obj":"CHEBI_44520"},{"id":"T486","span":{"begin":8002,"end":8010},"obj":"CHEBI_17790"},{"id":"T487","span":{"begin":8048,"end":8056},"obj":"CHEBI_49637"},{"id":"T488","span":{"begin":8521,"end":8529},"obj":"CHEBI_49637"},{"id":"T489","span":{"begin":8048,"end":8056},"obj":"CHEBI_18276"},{"id":"T490","span":{"begin":8521,"end":8529},"obj":"CHEBI_18276"},{"id":"T491","span":{"begin":8202,"end":8216},"obj":"CHEBI_30742"},{"id":"T492","span":{"begin":8244,"end":8248},"obj":"CHEBI_30742"},{"id":"T493","span":{"begin":8444,"end":8448},"obj":"CHEBI_30742"},{"id":"T494","span":{"begin":8264,"end":8267},"obj":"CHEBI_55390"},{"id":"T495","span":{"begin":8264,"end":8267},"obj":"CHEBI_30085"},{"id":"T496","span":{"begin":8268,"end":8276},"obj":"PATO_0000965"},{"id":"T497","span":{"begin":8480,"end":8488},"obj":"PATO_0000965"},{"id":"T498","span":{"begin":8292,"end":8298},"obj":"CHEBI_6446"},{"id":"T499","span":{"begin":8709,"end":8714},"obj":"PATO_0000002"},{"id":"T500","span":{"begin":9176,"end":9180},"obj":"PATO_0000469"},{"id":"T501","span":{"begin":9307,"end":9316},"obj":"PATO_0000044"},{"id":"T502","span":{"begin":9486,"end":9495},"obj":"PATO_0000044"},{"id":"T503","span":{"begin":9591,"end":9600},"obj":"PATO_0000044"},{"id":"T504","span":{"begin":9863,"end":9872},"obj":"PATO_0000044"},{"id":"T505","span":{"begin":9329,"end":9334},"obj":"CHEBI_24433"},{"id":"T506","span":{"begin":9535,"end":9543},"obj":"CHEBI_23019"},{"id":"T507","span":{"begin":9544,"end":9550},"obj":"CHEBI_15379"},{"id":"T508","span":{"begin":9544,"end":9550},"obj":"CHEBI_25805"},{"id":"T509","span":{"begin":9572,"end":9576},"obj":"PATO_0002316"},{"id":"T510","span":{"begin":9786,"end":9789},"obj":"PATO_0000322"},{"id":"T511","span":{"begin":10295,"end":10298},"obj":"PATO_0000322"},{"id":"T512","span":{"begin":10483,"end":10486},"obj":"PATO_0000322"},{"id":"T513","span":{"begin":10051,"end":10054},"obj":"CHEBI_43739"},{"id":"T514","span":{"begin":10071,"end":10080},"obj":"PATO_0000141"},{"id":"T515","span":{"begin":10204,"end":10214},"obj":"PATO_0000990"},{"id":"T516","span":{"begin":10274,"end":10277},"obj":"CHEBI_73627"},{"id":"T517","span":{"begin":10673,"end":10676},"obj":"CHEBI_73627"},{"id":"T518","span":{"begin":10274,"end":10277},"obj":"CHEBI_16238"},{"id":"T519","span":{"begin":10673,"end":10676},"obj":"CHEBI_16238"},{"id":"T520","span":{"begin":10274,"end":10277},"obj":"CHEBI_57692"},{"id":"T521","span":{"begin":10673,"end":10676},"obj":"CHEBI_57692"},{"id":"T522","span":{"begin":10342,"end":10347},"obj":"PATO_0000002"},{"id":"T523","span":{"begin":10473,"end":10482},"obj":"PATO_0000470"},{"id":"T524","span":{"begin":10542,"end":10548},"obj":"PATO_0001716"},{"id":"T525","span":{"begin":10725,"end":10728},"obj":"PATO_0000322"},{"id":"T526","span":{"begin":10893,"end":10896},"obj":"PATO_0000322"},{"id":"T527","span":{"begin":11353,"end":11356},"obj":"PATO_0000322"},{"id":"T528","span":{"begin":10796,"end":10801},"obj":"PATO_0000002"},{"id":"T529","span":{"begin":10954,"end":10957},"obj":"CHEBI_73627"},{"id":"T530","span":{"begin":10954,"end":10957},"obj":"CHEBI_16238"},{"id":"T531","span":{"begin":10954,"end":10957},"obj":"CHEBI_57692"},{"id":"T532","span":{"begin":11090,"end":11095},"obj":"CHEBI_24433"},{"id":"T533","span":{"begin":11924,"end":11929},"obj":"CHEBI_24433"},{"id":"T534","span":{"begin":11118,"end":11123},"obj":"CHEBI_33250"},{"id":"T535","span":{"begin":11831,"end":11835},"obj":"CHEBI_33250"},{"id":"T536","span":{"begin":11887,"end":11891},"obj":"CHEBI_33250"},{"id":"T537","span":{"begin":11131,"end":11137},"obj":"CHEBI_30396"},{"id":"T538","span":{"begin":11190,"end":11199},"obj":"PATO_0000044"},{"id":"T539","span":{"begin":11347,"end":11352},"obj":"PATO_0000586"},{"id":"T540","span":{"begin":11580,"end":11586},"obj":"PATO_0001503"},{"id":"T541","span":{"begin":11677,"end":11685},"obj":"CHEBI_49637"},{"id":"T542","span":{"begin":11836,"end":11844},"obj":"CHEBI_49637"},{"id":"T543","span":{"begin":11677,"end":11685},"obj":"CHEBI_18276"},{"id":"T544","span":{"begin":11836,"end":11844},"obj":"CHEBI_18276"},{"id":"T545","span":{"begin":11723,"end":11731},"obj":"PATO_0000040"},{"id":"T546","span":{"begin":11746,"end":11751},"obj":"PATO_0000133"},{"id":"T547","span":{"begin":11746,"end":11751},"obj":"PATO_0002326"},{"id":"T548","span":{"begin":11792,"end":11801},"obj":"PATO_0000141"},{"id":"T549","span":{"begin":11809,"end":11817},"obj":"CHEBI_25367"},{"id":"T550","span":{"begin":11931,"end":11937},"obj":"CHEBI_32988"},{"id":"T551","span":{"begin":12418,"end":12425},"obj":"PATO_0002258"},{"id":"T552","span":{"begin":12537,"end":12548},"obj":"PATO_0000146"},{"id":"T553","span":{"begin":12822,"end":12829},"obj":"CHEBI_43176"},{"id":"T554","span":{"begin":12864,"end":12870},"obj":"PATO_0001021"},{"id":"T555","span":{"begin":12978,"end":12984},"obj":"PATO_0001021"},{"id":"T556","span":{"begin":13189,"end":13195},"obj":"PATO_0001021"},{"id":"T557","span":{"begin":12973,"end":12977},"obj":"PATO_0002316"},{"id":"T558","span":{"begin":13184,"end":13188},"obj":"PATO_0002316"},{"id":"T559","span":{"begin":12994,"end":13003},"obj":"PATO_0000141"},{"id":"T560","span":{"begin":13375,"end":13384},"obj":"PATO_0000141"},{"id":"T561","span":{"begin":13062,"end":13065},"obj":"PATO_0001737"},{"id":"T562","span":{"begin":13365,"end":13368},"obj":"PATO_0001737"},{"id":"T563","span":{"begin":13066,"end":13071},"obj":"PATO_0000083"},{"id":"T564","span":{"begin":13369,"end":13374},"obj":"PATO_0000083"},{"id":"T565","span":{"begin":13099,"end":13108},"obj":"PATO_0001507"},{"id":"T566","span":{"begin":13165,"end":13174},"obj":"PATO_0000470"},{"id":"T567","span":{"begin":13240,"end":13248},"obj":"CHEBI_18276"},{"id":"T568","span":{"begin":13240,"end":13248},"obj":"CHEBI_49637"},{"id":"T569","span":{"begin":13390,"end":13395},"obj":"CHEBI_46882"},{"id":"T570","span":{"begin":13410,"end":13414},"obj":"CHEBI_16865"},{"id":"T571","span":{"begin":13811,"end":13819},"obj":"CHEBI_75958"},{"id":"T572","span":{"begin":13863,"end":13870},"obj":"CHEBI_46787"},{"id":"T573","span":{"begin":14013,"end":14020},"obj":"CHEBI_46787"},{"id":"T574","span":{"begin":14145,"end":14152},"obj":"CHEBI_46787"},{"id":"T575","span":{"begin":14463,"end":14470},"obj":"CHEBI_46787"},{"id":"T576","span":{"begin":14495,"end":14502},"obj":"CHEBI_46787"},{"id":"T577","span":{"begin":14730,"end":14737},"obj":"CHEBI_46787"},{"id":"T578","span":{"begin":13943,"end":13948},"obj":"CHEBI_24433"},{"id":"T579","span":{"begin":14181,"end":14187},"obj":"PATO_0001021"},{"id":"T580","span":{"begin":14193,"end":14199},"obj":"PATO_0001021"},{"id":"T581","span":{"begin":14607,"end":14613},"obj":"PATO_0001021"},{"id":"T582","span":{"begin":14665,"end":14671},"obj":"PATO_0001021"},{"id":"T583","span":{"begin":14677,"end":14683},"obj":"PATO_0001021"},{"id":"T584","span":{"begin":14764,"end":14770},"obj":"PATO_0001021"},{"id":"T585","span":{"begin":14881,"end":14887},"obj":"PATO_0001021"},{"id":"T586","span":{"begin":14956,"end":14962},"obj":"PATO_0001021"},{"id":"T587","span":{"begin":14968,"end":14974},"obj":"PATO_0001021"},{"id":"T588","span":{"begin":14188,"end":14192},"obj":"PATO_0002316"},{"id":"T589","span":{"begin":14963,"end":14967},"obj":"PATO_0002316"},{"id":"T590","span":{"begin":14203,"end":14212},"obj":"PATO_0001986"},{"id":"T591","span":{"begin":14213,"end":14222},"obj":"CHEBI_25367"},{"id":"T592","span":{"begin":14655,"end":14658},"obj":"PATO_0001737"},{"id":"T593","span":{"begin":15054,"end":15057},"obj":"PATO_0001737"},{"id":"T594","span":{"begin":14659,"end":14664},"obj":"PATO_0000083"},{"id":"T595","span":{"begin":15058,"end":15063},"obj":"PATO_0000083"},{"id":"T596","span":{"begin":14696,"end":14704},"obj":"PATO_0000757"},{"id":"T597","span":{"begin":14829,"end":14839},"obj":"PATO_0000990"},{"id":"T598","span":{"begin":14852,"end":14859},"obj":"PATO_0000006"},{"id":"T599","span":{"begin":15202,"end":15208},"obj":"PATO_0001021"},{"id":"T600","span":{"begin":15570,"end":15576},"obj":"PATO_0001021"},{"id":"T601","span":{"begin":16649,"end":16655},"obj":"PATO_0001021"},{"id":"T602","span":{"begin":15322,"end":15331},"obj":"PATO_0000044"},{"id":"T603","span":{"begin":15418,"end":15426},"obj":"CHEBI_25367"},{"id":"T604","span":{"begin":15638,"end":15647},"obj":"CHEBI_25367"},{"id":"T605","span":{"begin":15540,"end":15543},"obj":"PATO_0001737"},{"id":"T606","span":{"begin":15929,"end":15932},"obj":"PATO_0001737"},{"id":"T607","span":{"begin":15669,"end":15676},"obj":"CHEBI_46787"},{"id":"T608","span":{"begin":16117,"end":16124},"obj":"CHEBI_46787"},{"id":"T609","span":{"begin":16221,"end":16228},"obj":"CHEBI_46787"},{"id":"T610","span":{"begin":16260,"end":16267},"obj":"CHEBI_46787"},{"id":"T611","span":{"begin":16330,"end":16337},"obj":"CHEBI_46787"},{"id":"T612","span":{"begin":16627,"end":16634},"obj":"CHEBI_46787"},{"id":"T613","span":{"begin":15765,"end":15769},"obj":"CHEBI_27385"},{"id":"T614","span":{"begin":15771,"end":15776},"obj":"CHEBI_35255"},{"id":"T615","span":{"begin":15817,"end":15822},"obj":"PATO_0000587"},{"id":"T616","span":{"begin":15872,"end":15877},"obj":"PATO_0000586"},{"id":"T617","span":{"begin":15933,"end":15938},"obj":"PATO_0000083"},{"id":"T618","span":{"begin":15977,"end":15982},"obj":"CHEBI_15377"},{"id":"T619","span":{"begin":15977,"end":15982},"obj":"CHEBI_46629"},{"id":"T620","span":{"begin":15986,"end":15994},"obj":"CHEBI_44520"},{"id":"T621","span":{"begin":15986,"end":15994},"obj":"CHEBI_17790"},{"id":"T622","span":{"begin":16012,"end":16018},"obj":"CHEBI_24636"},{"id":"T623","span":{"begin":16019,"end":16024},"obj":"CHEBI_17499"},{"id":"T624","span":{"begin":16019,"end":16024},"obj":"CHEBI_17891"},{"id":"T625","span":{"begin":16029,"end":16037},"obj":"CHEBI_15339"},{"id":"T626","span":{"begin":16154,"end":16158},"obj":"PATO_0002316"},{"id":"T627","span":{"begin":16445,"end":16453},"obj":"PATO_0001933"},{"id":"T628","span":{"begin":16465,"end":16473},"obj":"CHEBI_18276"},{"id":"T629","span":{"begin":16465,"end":16473},"obj":"CHEBI_49637"},{"id":"T630","span":{"begin":16719,"end":16723},"obj":"PATO_0002316"},{"id":"T631","span":{"begin":18151,"end":18155},"obj":"PATO_0002316"},{"id":"T632","span":{"begin":16724,"end":16730},"obj":"PATO_0001021"},{"id":"T633","span":{"begin":16862,"end":16869},"obj":"CHEBI_46787"},{"id":"T634","span":{"begin":17061,"end":17068},"obj":"CHEBI_46787"},{"id":"T635","span":{"begin":17108,"end":17115},"obj":"CHEBI_46787"},{"id":"T636","span":{"begin":17215,"end":17222},"obj":"CHEBI_46787"},{"id":"T637","span":{"begin":17278,"end":17285},"obj":"CHEBI_46787"},{"id":"T638","span":{"begin":17547,"end":17554},"obj":"CHEBI_46787"},{"id":"T639","span":{"begin":17583,"end":17590},"obj":"CHEBI_46787"},{"id":"T640","span":{"begin":17709,"end":17716},"obj":"CHEBI_46787"},{"id":"T641","span":{"begin":17069,"end":17078},"obj":"CHEBI_25367"},{"id":"T642","span":{"begin":17116,"end":17125},"obj":"CHEBI_25367"},{"id":"T643","span":{"begin":17286,"end":17295},"obj":"CHEBI_25367"},{"id":"T644","span":{"begin":17476,"end":17484},"obj":"CHEBI_25367"},{"id":"T645","span":{"begin":17555,"end":17564},"obj":"CHEBI_25367"},{"id":"T646","span":{"begin":17591,"end":17600},"obj":"CHEBI_25367"},{"id":"T647","span":{"begin":17602,"end":17607},"obj":"CHEBI_15377"},{"id":"T648","span":{"begin":17602,"end":17607},"obj":"CHEBI_46629"},{"id":"T649","span":{"begin":17609,"end":17617},"obj":"CHEBI_44520"},{"id":"T650","span":{"begin":17609,"end":17617},"obj":"CHEBI_17790"},{"id":"T651","span":{"begin":17700,"end":17708},"obj":"CHEBI_15339"},{"id":"T652","span":{"begin":17771,"end":17774},"obj":"PATO_0001737"},{"id":"T653","span":{"begin":17775,"end":17780},"obj":"PATO_0000083"},{"id":"T654","span":{"begin":17962,"end":17966},"obj":"CHEBI_18282"},{"id":"T655","span":{"begin":17962,"end":17966},"obj":"CHEBI_22695"},{"id":"T656","span":{"begin":18002,"end":18008},"obj":"PATO_0001484"},{"id":"T657","span":{"begin":18042,"end":18053},"obj":"CHEBI_33232"},{"id":"T658","span":{"begin":18197,"end":18202},"obj":"CHEBI_33250"},{"id":"T659","span":{"begin":18328,"end":18333},"obj":"CHEBI_46629"},{"id":"T660","span":{"begin":19505,"end":19510},"obj":"CHEBI_46629"},{"id":"T661","span":{"begin":18328,"end":18333},"obj":"CHEBI_15377"},{"id":"T662","span":{"begin":19505,"end":19510},"obj":"CHEBI_15377"},{"id":"T663","span":{"begin":18334,"end":18343},"obj":"CHEBI_25367"},{"id":"T664","span":{"begin":18483,"end":18492},"obj":"CHEBI_25367"},{"id":"T665","span":{"begin":19225,"end":19234},"obj":"CHEBI_25367"},{"id":"T666","span":{"begin":19339,"end":19348},"obj":"CHEBI_25367"},{"id":"T667","span":{"begin":19466,"end":19475},"obj":"CHEBI_25367"},{"id":"T668","span":{"begin":19511,"end":19520},"obj":"CHEBI_25367"},{"id":"T669","span":{"begin":19628,"end":19637},"obj":"CHEBI_25367"},{"id":"T670","span":{"begin":18448,"end":18455},"obj":"PATO_0001997"},{"id":"T671","span":{"begin":18475,"end":18482},"obj":"CHEBI_46787"},{"id":"T672","span":{"begin":19217,"end":19224},"obj":"CHEBI_46787"},{"id":"T673","span":{"begin":19331,"end":19338},"obj":"CHEBI_46787"},{"id":"T674","span":{"begin":18740,"end":18744},"obj":"PATO_0000304"},{"id":"T675","span":{"begin":18748,"end":18756},"obj":"CHEBI_75958"},{"id":"T676","span":{"begin":19294,"end":19302},"obj":"CHEBI_75958"},{"id":"T677","span":{"begin":18790,"end":18796},"obj":"PATO_0001021"},{"id":"T678","span":{"begin":18876,"end":18881},"obj":"PATO_0000587"},{"id":"T679","span":{"begin":19019,"end":19030},"obj":"CHEBI_33232"},{"id":"T680","span":{"begin":19043,"end":19047},"obj":"PATO_0001309"},{"id":"T681","span":{"begin":19043,"end":19047},"obj":"PATO_0000165"},{"id":"T682","span":{"begin":19171,"end":19174},"obj":"PATO_0000471"},{"id":"T683","span":{"begin":19432,"end":19437},"obj":"CHEBI_46882"},{"id":"T684","span":{"begin":19532,"end":19536},"obj":"CHEBI_24757"},{"id":"T685","span":{"begin":19719,"end":19724},"obj":"PATO_0000587"},{"id":"T686","span":{"begin":20343,"end":20348},"obj":"PATO_0000587"},{"id":"T687","span":{"begin":19827,"end":19830},"obj":"CHEBI_33813"},{"id":"T688","span":{"begin":19838,"end":19841},"obj":"CHEBI_33813"},{"id":"T689","span":{"begin":19827,"end":19830},"obj":"CHEBI_15377"},{"id":"T690","span":{"begin":19838,"end":19841},"obj":"CHEBI_15377"},{"id":"T691","span":{"begin":19886,"end":19891},"obj":"CHEBI_15377"},{"id":"T692","span":{"begin":20020,"end":20025},"obj":"CHEBI_15377"},{"id":"T693","span":{"begin":20429,"end":20434},"obj":"CHEBI_15377"},{"id":"T694","span":{"begin":20496,"end":20501},"obj":"CHEBI_15377"},{"id":"T695","span":{"begin":20502,"end":20507},"obj":"CHEBI_15377"},{"id":"T696","span":{"begin":20550,"end":20555},"obj":"CHEBI_15377"},{"id":"T697","span":{"begin":20600,"end":20605},"obj":"CHEBI_15377"},{"id":"T698","span":{"begin":20624,"end":20629},"obj":"CHEBI_15377"},{"id":"T699","span":{"begin":20698,"end":20703},"obj":"CHEBI_15377"},{"id":"T700","span":{"begin":20762,"end":20767},"obj":"CHEBI_15377"},{"id":"T701","span":{"begin":20789,"end":20794},"obj":"CHEBI_15377"},{"id":"T702","span":{"begin":19827,"end":19830},"obj":"CHEBI_29375"},{"id":"T703","span":{"begin":19838,"end":19841},"obj":"CHEBI_29375"},{"id":"T704","span":{"begin":19842,"end":19849},"obj":"CHEBI_35505"},{"id":"T705","span":{"begin":19886,"end":19891},"obj":"CHEBI_46629"},{"id":"T706","span":{"begin":20020,"end":20025},"obj":"CHEBI_46629"},{"id":"T707","span":{"begin":20429,"end":20434},"obj":"CHEBI_46629"},{"id":"T708","span":{"begin":20496,"end":20501},"obj":"CHEBI_46629"},{"id":"T709","span":{"begin":20502,"end":20507},"obj":"CHEBI_46629"},{"id":"T710","span":{"begin":20550,"end":20555},"obj":"CHEBI_46629"},{"id":"T711","span":{"begin":20600,"end":20605},"obj":"CHEBI_46629"},{"id":"T712","span":{"begin":20624,"end":20629},"obj":"CHEBI_46629"},{"id":"T713","span":{"begin":20698,"end":20703},"obj":"CHEBI_46629"},{"id":"T714","span":{"begin":20762,"end":20767},"obj":"CHEBI_46629"},{"id":"T715","span":{"begin":20789,"end":20794},"obj":"CHEBI_46629"},{"id":"T716","span":{"begin":19935,"end":19944},"obj":"PATO_0000141"},{"id":"T717","span":{"begin":19959,"end":19968},"obj":"PATO_0001499"},{"id":"T718","span":{"begin":19969,"end":19977},"obj":"PATO_0000965"},{"id":"T719","span":{"begin":20061,"end":20068},"obj":"CHEBI_46787"},{"id":"T720","span":{"begin":20368,"end":20375},"obj":"CHEBI_46787"},{"id":"T721","span":{"begin":20853,"end":20860},"obj":"CHEBI_46787"},{"id":"T722","span":{"begin":20069,"end":20078},"obj":"CHEBI_25367"},{"id":"T723","span":{"begin":20376,"end":20385},"obj":"CHEBI_25367"},{"id":"T724","span":{"begin":20435,"end":20444},"obj":"CHEBI_25367"},{"id":"T725","span":{"begin":20795,"end":20804},"obj":"CHEBI_25367"},{"id":"T726","span":{"begin":20157,"end":20165},"obj":"CHEBI_75958"},{"id":"T727","span":{"begin":20289,"end":20297},"obj":"CHEBI_75958"},{"id":"T728","span":{"begin":20606,"end":20614},"obj":"CHEBI_49637"},{"id":"T729","span":{"begin":20711,"end":20719},"obj":"CHEBI_49637"},{"id":"T730","span":{"begin":20981,"end":20989},"obj":"CHEBI_49637"},{"id":"T731","span":{"begin":20606,"end":20614},"obj":"CHEBI_18276"},{"id":"T732","span":{"begin":20711,"end":20719},"obj":"CHEBI_18276"},{"id":"T733","span":{"begin":20981,"end":20989},"obj":"CHEBI_18276"},{"id":"T734","span":{"begin":20630,"end":20637},"obj":"CHEBI_33731"},{"id":"T735","span":{"begin":20945,"end":20950},"obj":"CHEBI_24433"},{"id":"T736","span":{"begin":21106,"end":21112},"obj":"CHEBI_15882"},{"id":"T737","span":{"begin":21249,"end":21256},"obj":"CHEBI_46787"},{"id":"T738","span":{"begin":21326,"end":21333},"obj":"CHEBI_46787"},{"id":"T739","span":{"begin":21334,"end":21341},"obj":"CHEBI_46787"},{"id":"T740","span":{"begin":21387,"end":21395},"obj":"CHEBI_46787"},{"id":"T741","span":{"begin":21453,"end":21460},"obj":"CHEBI_46787"},{"id":"T742","span":{"begin":21608,"end":21615},"obj":"CHEBI_46787"},{"id":"T743","span":{"begin":21786,"end":21793},"obj":"CHEBI_46787"},{"id":"T744","span":{"begin":22173,"end":22180},"obj":"CHEBI_46787"},{"id":"T745","span":{"begin":22371,"end":22378},"obj":"CHEBI_46787"},{"id":"T746","span":{"begin":22400,"end":22407},"obj":"CHEBI_46787"},{"id":"T747","span":{"begin":22468,"end":22475},"obj":"CHEBI_46787"},{"id":"T748","span":{"begin":21342,"end":21350},"obj":"CHEBI_18276"},{"id":"T749","span":{"begin":21342,"end":21350},"obj":"CHEBI_49637"},{"id":"T750","span":{"begin":21362,"end":21370},"obj":"CHEBI_17790"},{"id":"T751","span":{"begin":21362,"end":21370},"obj":"CHEBI_44520"},{"id":"T752","span":{"begin":21461,"end":21470},"obj":"CHEBI_25367"},{"id":"T753","span":{"begin":21616,"end":21625},"obj":"CHEBI_25367"},{"id":"T754","span":{"begin":21741,"end":21750},"obj":"CHEBI_25367"},{"id":"T755","span":{"begin":22181,"end":22190},"obj":"CHEBI_25367"},{"id":"T756","span":{"begin":22408,"end":22417},"obj":"CHEBI_25367"},{"id":"T757","span":{"begin":22442,"end":22451},"obj":"CHEBI_25367"},{"id":"T758","span":{"begin":21735,"end":21740},"obj":"CHEBI_15377"},{"id":"T759","span":{"begin":21735,"end":21740},"obj":"CHEBI_46629"},{"id":"T760","span":{"begin":21925,"end":21934},"obj":"PATO_0000141"},{"id":"T761","span":{"begin":22037,"end":22046},"obj":"PATO_0000141"},{"id":"T762","span":{"begin":21997,"end":22005},"obj":"CHEBI_75958"},{"id":"T763","span":{"begin":22096,"end":22104},"obj":"CHEBI_75958"},{"id":"T764","span":{"begin":22345,"end":22353},"obj":"CHEBI_75958"},{"id":"T765","span":{"begin":22148,"end":22153},"obj":"PATO_0000586"},{"id":"T766","span":{"begin":22365,"end":22370},"obj":"PATO_0000586"},{"id":"T767","span":{"begin":22477,"end":22482},"obj":"CHEBI_33250"},{"id":"T768","span":{"begin":22624,"end":22629},"obj":"CHEBI_33250"},{"id":"T769","span":{"begin":22610,"end":22616},"obj":"PATO_0001021"},{"id":"T770","span":{"begin":22643,"end":22652},"obj":"CHEBI_25367"},{"id":"T771","span":{"begin":22700,"end":22706},"obj":"PATO_0001021"},{"id":"T772","span":{"begin":23204,"end":23210},"obj":"PATO_0001021"},{"id":"T773","span":{"begin":23619,"end":23625},"obj":"PATO_0001021"},{"id":"T774","span":{"begin":23991,"end":23997},"obj":"PATO_0001021"},{"id":"T775","span":{"begin":22857,"end":22862},"obj":"PATO_0000586"},{"id":"T776","span":{"begin":22963,"end":22971},"obj":"CHEBI_33250"},{"id":"T777","span":{"begin":23105,"end":23113},"obj":"CHEBI_33250"},{"id":"T778","span":{"begin":22979,"end":22987},"obj":"CHEBI_75958"},{"id":"T779","span":{"begin":23579,"end":23587},"obj":"CHEBI_75958"},{"id":"T780","span":{"begin":23261,"end":23264},"obj":"PATO_0000308"},{"id":"T781","span":{"begin":23292,"end":23296},"obj":"PATO_0000165"},{"id":"T782","span":{"begin":23292,"end":23296},"obj":"PATO_0001309"},{"id":"T783","span":{"begin":23314,"end":23321},"obj":"PATO_0000006"},{"id":"T784","span":{"begin":23588,"end":23597},"obj":"PATO_0000141"},{"id":"T785","span":{"begin":23779,"end":23791},"obj":"PATO_0000060"},{"id":"T786","span":{"begin":23797,"end":23805},"obj":"PATO_0000438"},{"id":"T787","span":{"begin":23806,"end":23817},"obj":"PATO_0000146"},{"id":"T788","span":{"begin":23835,"end":23846},"obj":"PATO_0000146"},{"id":"T789","span":{"begin":23983,"end":23990},"obj":"PATO_0000461"},{"id":"T790","span":{"begin":24009,"end":24015},"obj":"PATO_0000918"},{"id":"T791","span":{"begin":24274,"end":24281},"obj":"PATO_0000006"},{"id":"T792","span":{"begin":24287,"end":24295},"obj":"CHEBI_75958"},{"id":"T793","span":{"begin":24380,"end":24385},"obj":"PATO_0001035"},{"id":"T794","span":{"begin":24554,"end":24559},"obj":"PATO_0001035"},{"id":"T795","span":{"begin":24426,"end":24432},"obj":"PATO_0001021"},{"id":"T796","span":{"begin":24600,"end":24606},"obj":"PATO_0001021"},{"id":"T797","span":{"begin":24797,"end":24803},"obj":"PATO_0001021"},{"id":"T798","span":{"begin":24888,"end":24894},"obj":"PATO_0001021"},{"id":"T799","span":{"begin":24519,"end":24523},"obj":"CHEBI_33250"},{"id":"T800","span":{"begin":24624,"end":24629},"obj":"CHEBI_33250"},{"id":"T801","span":{"begin":24701,"end":24706},"obj":"CHEBI_33250"},{"id":"T802","span":{"begin":24740,"end":24745},"obj":"CHEBI_33250"},{"id":"T803","span":{"begin":24849,"end":24854},"obj":"CHEBI_33250"},{"id":"T804","span":{"begin":24922,"end":24926},"obj":"CHEBI_33250"},{"id":"T805","span":{"begin":24989,"end":24994},"obj":"CHEBI_33250"},{"id":"T806","span":{"begin":24540,"end":24544},"obj":"PATO_0001309"},{"id":"T807","span":{"begin":25580,"end":25586},"obj":"PATO_0001309"},{"id":"T808","span":{"begin":24540,"end":24544},"obj":"PATO_0000165"},{"id":"T809","span":{"begin":24718,"end":24726},"obj":"CHEBI_25367"},{"id":"T810","span":{"begin":24755,"end":24764},"obj":"CHEBI_25367"},{"id":"T811","span":{"begin":25087,"end":25092},"obj":"PATO_0000587"},{"id":"T812","span":{"begin":25191,"end":25199},"obj":"PATO_0000438"},{"id":"T813","span":{"begin":25323,"end":25331},"obj":"PATO_0000438"},{"id":"T814","span":{"begin":25200,"end":25205},"obj":"PATO_0000002"},{"id":"T815","span":{"begin":25332,"end":25337},"obj":"PATO_0000002"},{"id":"T816","span":{"begin":25211,"end":25222},"obj":"PATO_0000146"},{"id":"T817","span":{"begin":25306,"end":25317},"obj":"PATO_0000146"},{"id":"T818","span":{"begin":25455,"end":25459},"obj":"PATO_0000573"},{"id":"T819","span":{"begin":25507,"end":25514},"obj":"PATO_0000461"},{"id":"T820","span":{"begin":25637,"end":25645},"obj":"CHEBI_75958"},{"id":"T821","span":{"begin":26692,"end":26700},"obj":"CHEBI_75958"},{"id":"T822","span":{"begin":25646,"end":25653},"obj":"PATO_0001019"},{"id":"T823","span":{"begin":25984,"end":25992},"obj":"CHEBI_25367"},{"id":"T824","span":{"begin":26046,"end":26057},"obj":"CHEBI_33694"},{"id":"T825","span":{"begin":26062,"end":26065},"obj":"CHEBI_16991"},{"id":"T826","span":{"begin":26157,"end":26160},"obj":"CHEBI_16134"},{"id":"T827","span":{"begin":26492,"end":26495},"obj":"CHEBI_16134"},{"id":"T828","span":{"begin":26360,"end":26374},"obj":"CHEBI_30742"},{"id":"T829","span":{"begin":26544,"end":26550},"obj":"PATO_0001484"},{"id":"T830","span":{"begin":26565,"end":26570},"obj":"PATO_0001035"},{"id":"T831","span":{"begin":26627,"end":26630},"obj":"CHEBI_59759"},{"id":"T832","span":{"begin":26658,"end":26661},"obj":"CHEBI_59759"},{"id":"T833","span":{"begin":26796,"end":26799},"obj":"CHEBI_59759"},{"id":"T834","span":{"begin":26627,"end":26630},"obj":"CHEBI_59760"},{"id":"T835","span":{"begin":26658,"end":26661},"obj":"CHEBI_59760"},{"id":"T836","span":{"begin":26796,"end":26799},"obj":"CHEBI_59760"},{"id":"T837","span":{"begin":26627,"end":26630},"obj":"CHEBI_42383"},{"id":"T838","span":{"begin":26658,"end":26661},"obj":"CHEBI_42383"},{"id":"T839","span":{"begin":26796,"end":26799},"obj":"CHEBI_42383"},{"id":"T840","span":{"begin":26627,"end":26630},"obj":"CHEBI_34757"},{"id":"T841","span":{"begin":26658,"end":26661},"obj":"CHEBI_34757"},{"id":"T842","span":{"begin":26796,"end":26799},"obj":"CHEBI_34757"},{"id":"T843","span":{"begin":26754,"end":26761},"obj":"CHEBI_46787"},{"id":"T844","span":{"begin":26970,"end":26977},"obj":"CHEBI_46787"},{"id":"T845","span":{"begin":26875,"end":26881},"obj":"PATO_0001021"},{"id":"T846","span":{"begin":27095,"end":27099},"obj":"CHEBI_25016"},{"id":"T847","span":{"begin":27095,"end":27099},"obj":"CHEBI_27889"},{"id":"T882","span":{"begin":29135,"end":29141},"obj":"PATO_0001021"},{"id":"T883","span":{"begin":28743,"end":28752},"obj":"CHEBI_25367"},{"id":"T884","span":{"begin":29171,"end":29180},"obj":"CHEBI_25367"},{"id":"T885","span":{"begin":29046,"end":29055},"obj":"PATO_0001997"},{"id":"T886","span":{"begin":29405,"end":29412},"obj":"PATO_0000001"},{"id":"T887","span":{"begin":29547,"end":29555},"obj":"CHEBI_75958"},{"id":"T407","span":{"begin":1782,"end":1791},"obj":"PATO_0000141"},{"id":"T408","span":{"begin":1796,"end":1802},"obj":"PATO_0001021"},{"id":"T409","span":{"begin":2063,"end":2074},"obj":"CHEBI_33232"},{"id":"T410","span":{"begin":2095,"end":2100},"obj":"PATO_0000586"},{"id":"T411","span":{"begin":2385,"end":2393},"obj":"CHEBI_49637"},{"id":"T412","span":{"begin":2385,"end":2393},"obj":"CHEBI_18276"},{"id":"T848","span":{"begin":27191,"end":27198},"obj":"CHEBI_46787"},{"id":"T849","span":{"begin":27269,"end":27276},"obj":"CHEBI_46787"},{"id":"T850","span":{"begin":27368,"end":27375},"obj":"CHEBI_46787"},{"id":"T851","span":{"begin":27713,"end":27720},"obj":"CHEBI_46787"},{"id":"T852","span":{"begin":28285,"end":28292},"obj":"CHEBI_46787"},{"id":"T853","span":{"begin":27216,"end":27219},"obj":"CHEBI_59759"},{"id":"T854","span":{"begin":27776,"end":27779},"obj":"CHEBI_59759"},{"id":"T855","span":{"begin":28032,"end":28035},"obj":"CHEBI_59759"},{"id":"T856","span":{"begin":27216,"end":27219},"obj":"CHEBI_59760"},{"id":"T857","span":{"begin":27776,"end":27779},"obj":"CHEBI_59760"},{"id":"T858","span":{"begin":28032,"end":28035},"obj":"CHEBI_59760"},{"id":"T859","span":{"begin":27216,"end":27219},"obj":"CHEBI_34757"},{"id":"T860","span":{"begin":27776,"end":27779},"obj":"CHEBI_34757"},{"id":"T861","span":{"begin":28032,"end":28035},"obj":"CHEBI_34757"},{"id":"T862","span":{"begin":27216,"end":27219},"obj":"CHEBI_42383"},{"id":"T863","span":{"begin":27776,"end":27779},"obj":"CHEBI_42383"},{"id":"T864","span":{"begin":28032,"end":28035},"obj":"CHEBI_42383"},{"id":"T865","span":{"begin":27277,"end":27286},"obj":"CHEBI_25367"},{"id":"T866","span":{"begin":27721,"end":27730},"obj":"CHEBI_25367"},{"id":"T867","span":{"begin":28436,"end":28445},"obj":"CHEBI_25367"},{"id":"T868","span":{"begin":27362,"end":27367},"obj":"PATO_0000586"},{"id":"T869","span":{"begin":27405,"end":27414},"obj":"PATO_0000141"},{"id":"T870","span":{"begin":27511,"end":27519},"obj":"CHEBI_33250"},{"id":"T871","span":{"begin":27856,"end":27861},"obj":"CHEBI_33250"},{"id":"T872","span":{"begin":27564,"end":27569},"obj":"PATO_0001035"},{"id":"T873","span":{"begin":27701,"end":27712},"obj":"PATO_0001772"},{"id":"T874","span":{"begin":28121,"end":28131},"obj":"PATO_0000990"},{"id":"T875","span":{"begin":28175,"end":28183},"obj":"CHEBI_75958"},{"id":"T876","span":{"begin":28403,"end":28409},"obj":"PATO_0001021"},{"id":"T877","span":{"begin":28415,"end":28421},"obj":"PATO_0001021"},{"id":"T878","span":{"begin":28410,"end":28414},"obj":"PATO_0002316"},{"id":"T879","span":{"begin":28513,"end":28517},"obj":"PATO_0002316"},{"id":"T880","span":{"begin":28426,"end":28435},"obj":"PATO_0001986"},{"id":"T881","span":{"begin":28660,"end":28666},"obj":"PATO_0001021"}],"text":"2. Methodology\n\n2.1. Experimental Methods\nAs discussed above, the existence of an intramolecular H-bond was accepted in this review on the basis of gas-phase experiments. In a number of the studies the spectra were recorded for jet-cooled systems. References to these are provided when the gas-phase and in-solution molecular structures are compared in Section 3. The jet-cooling technique [23] allows reaching local temperatures as low as about 5 K, with the advantage that the molecules assume their vibrational ground states. Under these conditions, the vibrational spectrum becomes simpler and different conformers can be more easily identified and characterized. Microwave spectra were recorded at room temperature or somewhat below [24,25,26,27]. Gas-phase electron diffraction structure determinations, sometimes at two or three different temperatures, were performed in the range of 297–733 K [28,29,30,31].\nFor in-solution IR spectra, the temperature was generally room temperature or not far from it in the experiments surveyed here. In these cases, the boiling point of the solvent imposes a limit for the upper temperature. It is known that signals can split in NMR studies by lowering the temperature. The lowest operational temperature is constrained, however, by the freezing point of the NMR solvent [32].\nIn summary, the gas-phase structural parameters were obtained from experiments conducted in a very large temperature range of 5–733 K. The in-solution investigations were mainly performed near room temperature. Thus, comparisons of the structural data between gas-phase and in-solution experiments, as well as to theoretical calculations referring to 0 K, need caution.\n\n2.2. Geometry Optimization\n\n2.2.1. General Problems\nThe first step in a quantum-chemical structure and energy analysis is the optimization of the molecular geometry. If more than one structural form (different conformers, tautomers) are to be considered, each of them has to be optimized. A very important point is the selection of a reliable theoretical method and the application of a satisfactorily large basis set. Clearly, one wants to obtain the best computational results possible within the technical limits of the given structural problem. When a seeming H-bonded system is under scrutiny, an additional problem emerges in that the system has to be identified whether it is really a hydrogen-bonding arrangement or not.\nIn the review [22], a reference list was provided for the theoretical approaches and basis sets most frequently used in the past 15–20 years for geometry optimizations and relative energy calculations for ground state, closed-shell systems. Geometries were mainly optimized at the ab initio Hartree-Fock (HF) and second-order Møller-Plesset perturbation theory (MP2) levels or using some DFT-based (density functional theory) method such as B3LYP or some more recent ones that account for the dispersion interaction like the B97D method of Grimme [33] or the M05 and M06 methods of Zhao and Truhlar [34]. For basis sets, the 6-311++G** Pople basis or correlation consistent basis sets like cc-pvXz or aug-cc-pvXz (X = d, t, q) [35] have been applied more often in the most recent studies. Relative energies of conformers and tautomers from ab initio calculations are sensitive to the applied level of theory (method + basis set). Accounting for the electron correlations beyond the MP2 approach has turned out to be very effective by the application of the coupled-cluster methods, CCSD and CCSD(T) (coupled-cluster singles and doubles and noniterative triples) [36,37]. Hobza proposed an extrapolation formula for calculating the molecular energy at the CBS (complete basis set level) utilizing the MP2 limit energy and the difference of the CCSD or CCSD(T) energy and the MP2 energy calculated with some smaller basis set [38]. Frequency analysis can verify local energy minimum geometries by finding all positive vibrational frequencies. Using classical partition functions for ideal-gas molecules, the free energies can be estimated at some temperature T and pressure p [39].\nWhile many papers have proven that the results are very sensitive to the level of theory, calculations applying high-level theoretical methods in combination with large basis sets may not be practical even for the case of small molecules. This is true not only for individual geometric parameters and energy values where it is normal that energy decreases with a higher-level method and/or a larger basis set, but even for the relative energies between conformations. Changes in the relative values suggest that the energy differences have not reached a converged limit value yet. A disappointing example was presented by DePrince and Mazziotti [40], who compared two tautomers of the CH3NO molecule at the CCSD and CCSD(T) levels. Whereas the relative energies were calculated at 29.1 and 21.4 kJ/mol, respectively, utilizing the cc-pvdz basis set, the CBS values are 2.8 and −6.2 kJ/mol. These changes are dramatic. They indicate that the basis set effect on the relative energy is very large and the selected methods at the CBS limit even predict different relative stabilities. The CCSD(T)/CBS result is perhaps more reliable, yet it can not be ascertained in the absence of experimental information whether the obtained value is an acceptable limit or even higher-level methods should be considered. Note that this problem can be noticed even for a very small molecule. For a larger molecule (e.g., 15–20 heavy atoms and corresponding hydrogens) upgrading the level of theory is even less practical. The situation could be worse regarding the optimized geometries. For relative energies, we can ascertain at least that the computed limit is questionable while for geometries there is no clue about the correct bond lengths, angles, and torsion angles. If they do not vary monotonically in parallel with the increasing level of theory, one may have even less idea about the correct limit values in the absence of experimental information.\nWhy are the above, otherwise well-known computational experiences important in a review regarding H-bonds? As was shown above, the critical point can emerge after a small distortion of the optimized 1,2-dihydroxybenzene geometry. This finding can be interpreted to mean that the H-bond is disrupted in the optimal geometry. Indeed, it is quite possible that the existence of the BCP is very sensitive to the structure as can be seen by the earlier the quoted notes of Contreras-García et al. [11]. Thus, the suspicion may emerge that the level of theory is not high enough when small geometric changes can create or perturb a H-bond. Using again the paper of Mandado et al., [7] as an example, the (3, −1) BCP was found on the B3LYP/6-31+G** density map, but the BCP disappeared at the B3LYP/6-311++G** level, and did not appear either when the B3LYP/6-311++G (3d,3p) density was studied. The authors considered the B3LYP/6-31+G** result as an artifact and attributed it to the lack of diffuse functions on the hydrogen atoms. Disappearance of the BCP with larger basis sets clearly indicates, however, the basis set effect on some calculated topological indeces, and calls for studying a reliable electron density map. Obtainment of the latter is perhaps possible at a very high theoretical level, but such calculations are not practical for larger molecules. On the other hand, if gradually increasing basis sets provide different predictions with respect to the existence of a BCP, then conclusions based on medium-size basis sets remain uncertain.\nTheoretical calculations are able to predict a shift in the X–H vibrational frequency if the bond is involved H-bond interaction. Gu et al., [41] studied the possible intramolecular H-bond for α-hydroxy acetic acid. The authors concluded that the red-shift of about 100 cm−1, based on former experimental results for the stretching frequency of the α-hydroxy group relative to that for a free O–H bond in methanol “can be attributed to internal OH…O= hydrogen bonding”.\nThe red-shift in the case of an intramolecular H-bond was also demonstrated, at least qualitatively, by HF/6-31G* calculations for the 1,2-ethanediol [42,43]. For the all-trans-OCCO conformer tTt (C2h symmetry, for the three-letter code see [44]), where the two oxygens are far from each other, the two O–H frequencies were calculated equally at 4124 cm−1. For the most stable OCCO gauche conformation, tG+g− (C1 symmetry) allowing for an intramolecular hydrogen bond, the two O–H frequencies differed in accord with the experimental finding (see below). The frequencies for the gauche form were calculated at 4095 and 4123 cm−1. The smaller value refers to the O–H vibration involved in intramolecular O–H…O bonding. The larger frequency is related to the free OH vibration in the tG+g− conformation, where “t” indicates the trans HOCC arrangement. As a free OH, its stretching frequency is practically not affected and is equal to that for the tTt conformer. The O…O and O…H distances are 277 and 236 pm, respectively, well within the structural parameter set accepted for a H-bond. Although the calculated high frequencies are generally overestimated by about 10% at the HF level in comparison with experimental values, the shift of the frequency for the O–H group involved in the intramolecular interaction has revealed. A similar conclusion can be drawn for the 2-OH benzoic acid, when the calculated phenolic O–H frequency in intramolecular interaction with the carbonyl oxygen is compared with the free OH stretching frequency, as 3952 vs. 4112 cm−1 [45].\nFlorio et al., [46] compared the OH stretching frequencies of the monomeric and dimeric forms of formic and benzoic acids. The experimental values showed a red-shift of 459 cm−1 upon formic acid dimerization. The calculated harmonic frequency differences were 556 and 435 cm−1 at the B3LYP/aug-cc-pvtz and MP2/aug-cc-pvtz levels, respectively. For formic acid and its dimer, the geometry optimizations by the two methods led to very similar structure parameters, generally also close to the experimental values. The predicted frequencies at the MP2/aug-cc-pvtz level showed consistent overestimations for the monomer and the formic acid dimer (FAD), resulting in a red-shift close to the experimentally observed value. B3LYP/aug-cc-pvtz calculations provided, however, a larger overestimation for the monomer than for the dimer, leading to an increased red-shift. The authors concluded that the results “provide strong evidence that the B3LYP method does not provide a quantitatively correct description of this aspect of the H-bonding in the FAD dimer”.\nUpon the benzoic acid dimerization, the red-shift was 217 cm−1 experimentally as compared with the theoretical value of 616 cm−1 calculated at the B3LYP/6-311+G(2d,2p)/B3LYP/6-31+G(d) level. In this case the red-shift was even more strongly overestimated than that for FAD (see above) by the B3LYP method. These calculations utilized, however, a smaller basis set, B3LYP/6-311+G(2d,2p) for the carboxylic group and 6-31+G(d) for the atoms of the phenyl rings. For the benzoic acid systems, the calculated frequency was overestimated for the monomer and underestimated for the dimer. This latter result differs from that for the formic acid dimer. The calculated large red-shift for the benzoic acid systems may be explained by the interplay of the method and basis set. Since it was already qualified by the authors that the method does not quantitatively describe the H-bond for the dimer of a simple carboxylic acid, this likely applies to the benzoic acid dimer as well.\nIn intramolecular hydrogen bonds, the geometry for both the H…Y distance and the X–H…Y angle is primarily determined by the covalent structure of the molecule. While three-atom hydrogen-bonded rings are extremely rare, the four-atom substructures (e.g., carboxylic group, amides) deserve special consideration. In most cases, a H-bond can be expected if the system can form a five to seven-member ring, including arrangements utilizing a polar H. Prototypes are indicated in Figure 1 and typical representatives of five- and six-member rings are shown Figure 2 and Figure 3. Seven-member rings can be formed for γ-substituted carboxylic acids, 1,4-disubstituted butanes with OH and/or NH2 substituents.\nLarger rings are probably not stable. Chen et al., [47] pointed out that no intramolecular H-bond exists in the prevailing conformer of 1,5-pentadiol and 1,6-hexadiol at room temperature. In these cases, formation of an intramolecular H-bond would require a ring conformation with eight and nine members, respectively. This is probably unfavorable due to entropy considerations even for seven-member rings. Nagy et al., [48] studied different conformers for γ-hydroxy-butyric acid. Although the lowest-energy conformer optimized at the MP2/6-311++G** level formed a seven-member ring with an O–H…O= H-bond, the free energy for this structure is higher by about 2 kJ/mol than that for the most stable gas-phase species where this bond is disrupted, as also found experimentally [49]. The results for the increased relative free energy suggest unfavorable entropy effects for the hydrogen-bonded seven-member ring. This explanation is supported by the argument of Blanco et al. [50], who investigated the gas-phase structure of γ-amino-butyric acid (GABA). Intramolecular H-bonds were noticed in both forms of N–H…O=C and N…H–O–C=O, although the two mostly populated species do not possess an intramolecular H-bond. In order to create such bonds, structures have to be formed which “contribute to decrease entropy and to increase the Gibbs energy” [50].\n\n2.2.2. Special Problems\nIn general, for the past twenty years optimized molecular geometries in solution have been obtained by applying a continuum solvent model. The idea was introduced and subsequently developed by the Tomasi group as the PCM model [51,52]. Since the 1990’s, different continuum solvent models [53,54,55,56,57,58,59] and extension beyond the dielectric approximation [60] have been developed to account for the solvent effects on the geometry and energy/free energy of dissolved molecules. Several reviews summarize these models and compare the results obtained from different approaches [52,61,62,63].\nThe basic idea in the widely used PCM method [51] is that the solute is placed in a cavity carved in the continuum dielectric solvent, and the solute and the solvent mutually polarize each other. As a consequence, the solute’s geometry changes slightly and its internal energy increases when compared to its optimized gas-phase energy. The energy-increase is balanced by the developing solute-solvent electrostatic interaction energy. The final results are obtained through an iterative self-consistent-field (SCF) process that finds the total energy minimum and its related geometry. For the geometry optimization and energy/free energy calculations, all methods can be utilized, which were mentioned in relation to gas-phase calculations [33,34,35,36,37,38]. Thus, geometry optimizations can be performed by means of the HF, MP2 and DFT methods, and higher level energy calculations can be performed up to the CCSD(T)/CBS level. The customary basis set for geometry optimization and frequency analysis is 6-311++G**, but even the aug-cc-pvtz set has been applied [15,64].\nWhen a molecule dissolves, a close molecular environment is encountered that is in contrast to the most frequently applied ideal-gas model, where no potential energy interaction is considered even through the collisions of the molecules. Although the solute-solvent interactions are substantial, the effect of a non-polar or only slightly polar solvent (CCl4, CHCl3) on the molecular geometry is generally small [15,64]. The geometric effect could be, however, large when a solute with an intramolecular H-bond in the gas phase dissolves in a protic solvent such as water or methanol, which have both proton donor and acceptor sites. In this case, the X–H…Y intramolecular H-bond may collapse while solute-solvent H-bonds are formed using the free XH and Y sites.\nThe weakest point of the continuum dielectric solvent model is that the above solute-solvent H-bond(s) are only implicitly mimicked by polarization of the solvent and concomitant appearance of surface charges on the inner surface of the cavity: Negative surface charges opposite to a polar hydrogen and positive ones in the lone-pair regions of the solute’s oxygens and nitrogens. Although this response is qualitatively correct, the calculated solute-solvent stabilization energy is underestimated [65,66]. Thus, for proper calculation of the free energy changes when a polar solute with or without internal H-bond(s) dissolves in a protic solvent, explicit consideration of the solute-solvent intermolecular H-bonds becomes necessary.\nThis requirement can be largely satisfied by adopting the supermolecule + continuum approach, where the solute is surrounded by a number of explicit solvent molecules. The solute and the explicit solvent molecules mimic the H-bonds in the first solvation shell within the cavity carved in the continuum solvent. The critical question then becomes, how many explicit solvent molecules are to be considered.\nFor constructing the starting geometry of a supermolecule, knowledge of microsolvated solute structures is very helpful. In these systems, the central, polar molecule with or without an intramolecular H-bond is solvated by a few solvent molecules. Locations of the solvent molecules (water, methanol) indicate the most preferable solvation sites of the solute with a hydrogen donor/acceptor solvent. Useful information can be obtained from experimental gas-phase hydration/solvation studies augmented with theoretical calculations [12,13,14,16,23,67,68,69,70] or specific theoretical calculations for hydrated amino acid side chains, nucleotid base and sugar models [71,72,73,74,75].\nRecent calculations proved [15,64] that application of at least the aug-cc-pvtz basis set is required for reliable estimation of the relative solute free energies. If the solute has 6–10 C, N, O atoms and connected hydrogens, 500 basis functions could easily be required. If such a solute has to be surrounded by at least 5–6 water molecules, the number of basis functions increases to about 1000. The number of basis functions could be somewhat reduced by considering the solvent molecules with a lower basis set, with, e.g., 6-31+G**. While the supermolecule + continuum approach can be useful theoretically, it suffers from several technical challenges.\n(1) The geometry optimization for a system with 500–1000 basis functions is very slow in solution. If one wants to prove the local-energy-minimum character of the supermolecule and calculate thermal corrections, very small remaining forces should be allowed only at the end of the optimization. It is almost unreachable for a number of systems (or only by the application of the very time-consuming analytical second-derivative methods), in cases when torsion or intermolecular vibration frequencies could be as low as a few cm−1.\n(2) The number of explicit solvent molecules to be considered can become be critical. In a real, dilute solution the solute is surrounded by solvent molecules all around. Except for the simplest modeling cases like the partial solvation of γ-amino-butyric acid with 2–5 water molecules [76] or consideration of 3–8 water molecules during the HOCl catalyzed tautomerization of β-cyclopentadione [77], a considerably larger number of water molecules is generally required for reasonable modeling of the solvation sphere even for a small organic molecule. An impressive example was provided by Lu [78], who optimized the geometry of the Al(H2O)63+·12 H2O hydrate at the B3LYP/6-31+G(d,p) level in a water continuum by the PCM method. The resulting structure was of nearly spherical symmetry, easily allowing for the formation of the water network. Consideration of eighteen solvent molecules was necessary for mimicking the first and second hydration shells in a dilute solution.\n(3) The results of Lu and coworkers call attention to the need for the supermolecule to reasonably mimic the immediate in-solution environment of the solute. With a relatively small number of explicit solvent molecules within the supermolecule (for example, 3–4 water molecules, originally each of them facing a polar site), the water-water interactions may dominate over the solute-water interactions. Instead of forming 3–4 solute-water hydrogen bonds, a water cluster is then formed on some side of the solute and the number of water-solute hydrogen bonds would be smaller than expected in a water box with hundreds of water molecules. A successful tetrahydrate model in a continuum solvent was developed by Nagy for the syn-anti transformation of the acetic acid carboxylic group [79], whereas three waters in hydrogen bonds to the solute were not enough for modeling the immediate solvation environment of the transition state for 2F-phenol [15].\n(4) In general, only the first solvation shell around the polar sites can be modeled. Moreover, even in these cases, the explicit-solvent/continuum interface suffers from neglecting the consideration of the solvent-solvent hydrogen bonds. For methanol or acetonitrile solvents, the problem is not dramatic since the polar site of the solvent molecules should point toward the solute while the methyl group is located mainly on the outer surface of the supermolecule. Then the first-sphere solvent molecules can create a non-polar surface toward its continuum representation. This is surely not the case for explicit water molecules and is likewise questionable for a solvent like acetic acid with two stericly separated polar sites.\n(5) Geometry optimization for a supermolecule leads to an overly ordered structure, which is not maintained due to thermal disordering in a real solution.\n(6) If one wants to study the structure of a dilute or moderately concentrated (1 molar) solution as well as solute dimerization, boxes of a large number of explicit solvent molecules should be considered. These studies typically then require Monte Carlo (MC) [80] or molecular dynamics (MD) [81] simulations.\nDuring MC calculations, the solution model is a large solvent box with hundreds of solvent molecules and one or a few solute molecules embedded in the solvent. Atoms are represented by point-like centers characterized with atomic charges and assigned van der Waals parameters. The interaction energy of the atoms in different molecules is calculated by pair potentials and the total energy is the sum of these pair-interaction energies. Macroscopic thermodynamic quantities are estimated by averaging the individual values calculated for a large number of consecutive configurations. A configuration means a specific geometric arrangement of the elements in the solution box. The method is a probability method, meaning that a new configuration with modified geometric arrangement of the elements is considered for the above averaging upon the probability of the acceptance of the total energy change. If the new configuration is rejected, the old one is considered one more time in the averaging process. The most frequently applied sampling procedures are the Metropolis procedure [82] or some suitable procedure [83,84], which can accelerate the convergence of the calculated averages for thermodynamic quantities or help to more quickly reach an equilibrium solution structure by applying a biased energy calculation and probability for the acceptance of a new configuration. The goal is to generate a series of configurations corresponding to the Boltzmann distribution. For constant temperature simulations, the temperature is a parameter of the expression determining the acceptance probability. After having generated the required set of configurations, the average energy, enthalpy, volume, etc., can be calculated as an arithmetic mean of the individual values obtained with each configuration. If a biasing sampling was used, the probability of the acceptance has to be corrected before calculating averages.\nThe MD simulation is a deterministic process. The solution box is established as described above, but the atomic masses are also considered. A force field is used for calculating the total energy of the system with a given geometric arrangement (with Cartesian coordinates for each atom) at a reference time “t”. The force field contains terms accounting for the energy contributions by atoms bound along a 1–2–3–4 path, as well as for interactions of more remote atoms within the molecule and with all atoms in other molecules. The system generally is not in energy minimum, thus there are forces acting on the atoms. Using the gradient of the total energy, the forces acting on each atom can be determined. Applying Newton’s law, the position of the atoms at t + Δt can be calculated by means of the determined instantaneous velocities. Δt must be small, generally being chosen between 0.5 and 2 fs. In the latter case, the X–H distances are kept at a constant value. The temperature is related to the sum of the atomic kinetic energies. The simplest way to keep the temperature at a constant value is by scaling the determined atomic velocities or by coupling a thermostat to the system. Letting the simulation run long enough, sometimes for tens of nanoseconds, the average of the thermodynamic quantities can be obtained for a simulation period, where some structural characteristics, e.g., the solution density, have reached an equilibrium value already. By examining the trajectories calculated for geometric parameters of the solute, structural changes can be followed.\nUsing intermolecular pair-potentials such as OPLS-AA [85,86], Amber [87] or CHARMM22 [88], the largest problem is the development of the relevant atomic charges for the molecule. For example, Amber was originally parameterized for biopolymers and DNA, and no special charge parameters were available for, e.g., the HO–CHx–CHx–Y (Y = OH, NH2, NH3+, x = 1, 2) substructures. Furthermore, the OH and Y charges (and the CHx values, as well) should be conformation dependent, since, e.g., there is an intramolecular H-bond for the tG+g− conformer of 1,2-ethanediol, which is missing in the tTt form (Figure 2). Also, conformation dependent charges have to be used with Y = NH2, and NH3+ for the OCCN gauche and trans structures [89]. Recent developers of force fields suggest using molecular electrostatic potential (MEP) fitted charges, where the MEP should be obtained for the in-solution optimized solute. Since the solute and the continuum solvent mutually polarize each other, the MEP obtained at the end of the SCF procedure for the geometry optimization and energy minimization, reflects the electrostatic potential of a polarized solute in a polarized solvent environment. Charge fitting is a working tool, although different fitting methods (ES [90], RESP, [91], CHELPG [92]) lead to different results. Nonetheless, the problem is again of how to optimize the solute in a solvent environment. The MEP for the supermolecule is not relevant, since the solvent molecules are not so strictly bound to the solute due to thermal disorientation in a large solvent box as would follow from the structure of an optimized supermolecule. Furthermore, there is generally some charge transfer between the elements of the supermolecule. Although the programs force the charge of the total supermolecule to zero or some +/− integer for ionic solutes, the individual charges for the solute and the surrounding solvent molecules generally differ from 0, +/−1 etc. Thus, the MEP- fitted charges for a supermolecule should not be directly accepted for the atoms of the solute and applied in the MC of MD intermolecular pair-potentials. A possibility is that the solute geometry is accepted from the optimized supermolecule, and the MEP is fitted for the pure solute in a single-point calculation. This is, however, not a consistent procedure. The author has not found a good solution for this problem when surveying the literature.\nDespite the listed potential problems, the continuum solvent approach has been one of the most frequently used theoretical methods for characterizing the geometry and the energy/free energy for dissolved molecules. Since chemical equilibria depend on relative rather than absolute free energies, the problems mentioned above may not emerge in every case so harshly, and the errors could be partially cancelled. For example, the energy minimization in the supermolecule approach leads to too tightly bound water molecules. In a model, where the thermal disordering effect is also taken into consideration (MC and MD), a more loosely bound first solvation shell is expected. Nonetheless, since relative energy data are to be compared for the supermolecules with different solute conformations, the error is probably decreased. Also a more or less cancelled error may be expected regarding the interaction energy between the explicit solvent molecules at the outer surface of the supermolecules and the continuum. In a study for a series of compounds, it is a good practice to compare the computational results with available experimental values. Unfortunately, however, good-quality experimental results generally not are available in the literature for equilibria, where a number of conformers have been detected in solution.\n\n2.3. Free Energy Calculations\nThe focus of this subsection is the free energy calculation for explicit solvent models. The continuum-solvent calculations characterized above can provide free energies for individual solutes in any conformation, protonation state and tautomeric form. The method is not well suited, however, to the problem under investigation in the present survey pertaining to the possible disruption of intramolecular H-bonds in protic solvents. Since calculation of relative free energies is satisfactory for finding the more stable conformation, a perturbation method can be utilized in simulations where large solvent boxes including a solute and explicit solvent molecules are considered. The perturbation procedure is based on the work of Zwanzig and Jorgensen [93,94] and is widely used as the “free energy perturbation (FEP) method”.\nWhen the FEP method is applied, the atoms of solute and solvent molecules are characterized as sets of point charges with assigned van der Waals parameters. Locations of the solute charges symbolize those for atomic nuclei in the molecule. The combined effect of the accepted net solute charges should reproduce the in-solution molecular electrostatic potential nearly within and out of the van der Waals surface. To achieve this, it is useful to obtain the values of the point charges and determine their locations for the involved conformers/tautomers of the solute by geometry optimizations followed by the fit of charges to the in-solution MEP. In the applications of FEP below, the geometry of the solute with an intramolecular H-bond is converted into another structure without this bond. The atomic charges are also converted gradually and simultaneously from the starting set to final set, characterizing the corresponding conformations. The free energy is a state function, thus even if the intermediate states do not exist physically (when, e.g., the proton annihilation/proton development path is traveled) the sum of the perturbed free energy increments are theoretically correct. The molar free energy increment in the “i”th step, ΔGi is calculated as −RT ln[exp(−(Epi − Eri)/RT)]av, where the average of the exponential expression has to be calculated through a long simulation. Eri and Epi and are the total energies of the system per mole in a given configuration with parameters applied for the reference and perturbed solutes, respectively. The FEP in this case is led through controlled conformations or states. The constancy of the charge and geometry parameters is maintained only for a perturbation step. The actual parameters for the reference and perturbed states can be determined by a linear transformation of their values between the starting and the end structures. For example, conformations with and without an intramolecular H-bond correspond to perturbation parameters λ = 0 and λ = 1, respectively. In order to keep the free energy increments below about 4 kJ/mol, Δλ may be as small as about one hundredth.\nThe models used for water as solvent are generally TIP3P or TIP4P [20]. Some rigid, united atom CHn (n = 1–3) models also are available for small organic solvents such as CCl4, CHCl3, CH2Cl2, CH3CN, CH3OH [95] when the BOSS program (Biochemical and Organic Simulation System) [96] is used for Monte Carlo simulations [97,98,99]. The solvent models were parameterized for producing good density and heat of vaporization for the neat solvent, thus the accepted model implicitly accounts for the solvent-solvent interactions, including mutual polarizations of these molecules. Recently, however, one can create solvent boxes with all-atom solvent models by using the OPLS-AA 12–6–1 force field parameters [85,86].\nFigure 4 The free energy perturbation (FEP) curves for the transformations of conformers with an intramolecular H-bond to structures without H-bonds. Shown are 1,2-ethanediol (1) to (2), salicylic acid (3) to (4), and β-alanine zwitterion (5) to (6) where structure numbering is taken from Figure 2. Figure 4 is an illustration of the course of three FEP curves for conformational changes of the same molecules schematically compared in Figure 2. It is clear that the FEP, provided as the percent transformation, is not necessarily monotonic. For the two neutral molecules the most favorable hydration can be expected at about 20% and 60% transformations between the starting and finishing local energy minimum structures. For the gauche zwitterionic β-alanine, the ionic sites are less open for hydration and the solvation free energy the less favorable (5 is the trans, 6 is the gauche conformation).\nIn MD simulations, the solute geometry can change under the forces acting on each solute atom. Determination of bond stretching, bond bending and torsion parameters for every special molecule with possible intramolecular H-bond(s) is a very time consuming procedure. On the other hand, application of some average reference geometric parameters, stretching, bending force constants, and acceptance of average torsion potentials for these rather special molecules may lead to unreliably distorted structures in a flexible solute model. Thus use of the rigid optimized geometry could be favored from energy point of view and the computer time must be shorter if the energies of the solution configurations are to be averaged for a model with all rigid elements as compared with a slower convergence in case of a flexible solute. This option is not available for MD simulations, where individual molecular mechanics parameterization is desirable for high quality simulations. In some applications of the software, the lengths of the σ-bonds with hydrogen can be kept at fixed values although the problem of the critical torsion flexibility still remains.\nA further problem related to the use of the flexible solute model is that the MEP-fitted-charges characterize only the given optimized geometry. If the solute leaves this structure even temporarily, the charges are not relevant for the new geometry. Common in most MD and MC programs used in the past twenty years, however, is that the atomic point charges do not change through the simulation. In MD simulations with flexible solute geometry, the atomic charges are kept constant even if the structure changes among quite different conformational states.\nThis problem may be overcome by the use of the fluctuating charge (FC) model [100,101]. This approach introduces a polarizable force field, where the atomic point charges are allowed to fluctuate in response to the environment. Accordingly, this approach can account for the conformation dependence of the charges through the calculation of the solute-solvent interaction energies relevant to the actual solute geometry. The computation time increases by only about 10%, thus the method is applicable for simulations of large systems [102,103]. Nonetheless, MD/FEP calculation using the FC model has not been found even for small molecules in the surveyed literature.\nThe present review concentrates on results obtained with the described methods, although more recent simulation programs allow consideration of polarization charges on the solute and induced dipoles on the solvent [104]. The QM/MM (quantum mechanics/molecular mechanics) method [105,106] corresponds to the state-of-the-art level, but no article has been found where the QM/MM procedure was used for resolving the problem addressed in the title of this paper. The Car-Parinello molecular dynamics procedure (CPMD) [107] is another high-level theoretical method that could be applied as a state-of the-art procedure. The method applies pseudopotentials and the plane-wave basis set with periodic boundary conditions. The primary advantage of CPMD in comparison with ab initio molecular dynamics methods is that by introducing the extended variable Lagrangian formalism, CPMD can avoid the time demanding self-consistent matrix diagonalization at every step in the trajectory. Despite the attractive features of the method, the treatment of the electronic structure allows the application of CPMD only for systems that are remarkably smaller than those which can be easily considered in classical simulations. Although the possible disruption of the intramolecular H-bond in solution has been investigated only for small solutes below, the need to consider several hundred solvent molecules must have prevented applying CPMD because no such study has been found in the literature search.\n\n2.4. Dimeric Solutes\nThe issue to be considered in this section is similar to those that have been discussed above. Solutes with two polar sites for the monomer may form one or even two hydrogen bond(s) within a dimer, which is/are intramolecular from the perspective of this species. Then the created intramolecular H-bond(s) compete(s) with the intermolecular H-bonds between the monomeric solutes in the dissociated form and the solvent molecules. Specifically, monomers with HX–C=O (X = N, O) and N–C–OH substructures would belong to this category. Formation of an intramolecular H-bond in a four-member ring with two polar functionals would lead to a strained structure. Although covalently bound four-member rings exist, a moderately strong H-bond could not maintain this relationship. B97D/aug-cc-pvtz geometry optimization [79] found geometry parameters for the gas-phase acetic acid carboxylic group very close to experimental values [28]. Accordingly, the H…O= distance is 230 pm and the O–H…O= angle is 75.5°. This bond angle is quite far from the favorable O–H…O angle of about 170° generally found to be favorable intermolecular H-bond. Thus the OCOH moiety should not be considered to be a ring, and the better way to stabilize the HX–C=O or N–C–OH substructure is the formation of a dimer.\nThe gas-phase structure of formic acid was determined from the microwave spectrum by Lerner et al. [108]. The main geometric parameters were obtained by their fit to the rotational constants. However, formic acid assumes mostly a dimeric structure (FAD) in the gas phase. In a theoretical study, Turi [109] identified seven stable dimeric structures on the potential energy hypersurface. The doubly-hydrogen-bonded isomer of C2h symmetry was found to be the most stable arrangement. Experiments confirmed this theoretical prediction. A comprehensive discussion of the formic acid dimer related issues, the paradigm of symmetric hydrogen bonding, and a collection of former experimental papers were provided by Zielke and Suhm [110]. Rotationally resolved spectra were recorded under supersonic jet conditions for the FAD by Matylitsky et al. [111]. With the assumption of unperturbed monomers, a center-of-mass distance of R = 299.0 ± 0.1 pm for the monomers within the dimer was calculated from the spectroscopic results. A recent update of the experimental results on FAD augmented with theoretical calculations up to the MP2/aug-cc-pvtz level was provided by Balabin [112].\nGeometric parameters for monomeric and dimeric acetic and propionic acids in the gas phase were experimentally determined by Derissen [28,113]. Structures of the short-chain carboxylic acids to be discussed below are shown in Figure 5.\nFigure 5 Structures of the syn (11) and anti (12) acetic acid, the s-cis propenic acid (13) and the s-trans pyruvic acid (14). The carboxylic group is syn for the latter two. Simple aliphatic carboxylic acids assume predominantly doubly H-bonded, symmetrical dimeric forms (52%–87%) in the gas phase. The intermolecular H-bonds for acetic acid–acetic acid methyl ester complexes were studied by Emmeluth and Suhm [114] by FT-IR (Fourier transform infrared) spectroscopy for the mixed clusters in supersonic jet expansions. The methyl ester has two hydrogen-bond acceptor sites without having a strong donor site. The structural issues for the closest analogues of the acetic acid dimer and the acid-ester complexes were discussed on the basis of the recorded spectra and theoretical calculations at the MP2/6-311++G** and B3LYP/6-31G+G* levels. An interesting question is then: Will the dimeric forms be maintained in aqueous solution, or will the stabilizing “intramolecular” hydrogen bonds become disrupted so that the monomers can form intermolecular H-bonds with the water molecules.\nAnother important problem is the protonation state of the elements in the complexes of short-chain acids and bases. Such hydrogen-bonded complexes are stably formed from a neutral acid and a neutral amine in the gas phase, although acid—Guanidine complexes in the form of a hydrogen-bonded ion-pair are also stable in the gas phase [115]. In aqueous solution, the acid-amine ion-pair complexes are more stable, whereas the hydrogen-bonded complexes of the neutral elements are more stable in a low-polarity environment [116]. Such studies are important for the exploration of the ligand-receptor interactions in the binding cavity of the muscarinic acetylcholine receptor. At physiological pH of 7.4, a ligand with an amine function must be mainly protonated in the aqueous environment characteristic for the extracellular side of a transmembrane protein. When the ligand penetrates into the protein, most of the surrounding water molecules must be left behind. In the binding cavity of the receptor, at about 1100 pm from the surface, the ligand interacts with an aspartic acid side chain. According to a general consensus, the side chain is originally deprotonated. Thus a theoretical study of the protonation states for the partners under the modeling condition is justified [116].\nThe studies for dimer/complex formations in solution start with the geometry optimization of the associated species in the selected solvent. Using the continuum solvent approach, the poor handling of the solute-solvent H-bonds is less dramatic for the dimers of aliphatic acids. The main H-bond forming sites are involved in bonds to the partner, thus a less polar molecular surface of the dimer is seen by the solvent within the cavity. An acetic acid—Alkyl amine (mainly trimethyl amine) complex exhibits even larger non-polar molecular surface. The polar sites interact with each other in the depth of the complex, irrespective of whether the H-bond formed between the constituents is neutral or ionic.\nThe possible dimerization/complex formation in solution can be followed by calculating the potential of mean force (pmf) curve. This curve reflects the change of the solution free energy as a function of the solutes’ separation, taking the free energy of the solution with largely separated solutes as the reference state. If the pmf is calculated by the FEP procedure, the “R” separation of two reference atoms of the solutes should change only slightly by, e.g., 20 pm, and the free energy at separations R ± 10 pm could be calculated as a perturbation. Local minima of a pmf indicate stably associated forms, local maxima indicate barrier tops for association. The pmf may have more than one minimum site, also could decrease monotonically from the large-solute-separation reference state until reaching the minimum. The latter shape of the pmf indicates solute association without activation free energy. At small solute separations generally below 350 pm, the curves steeply ascend due to the quickly increasing van der Waals repulsions.\nFor large, more than 500–600 pm separation of the reference atoms, acceptance of the monomeric geometry and the related net atomic charges is reasonable. When the formation of the intermolecular H-bond(s) may start between the solutes, the geometry and the charge parameters should be gradually transformed into the values that were obtained for the optimized dimer. The related charges were fitted to the corresponding in-solution MEP [79,117].\n"}