Hydrogen Bond Distances and Base Plane Angles in GC Base Pairing

For two RNA bases to be considered base-paired, they must form hydrogen bonds at close range and lie in roughly the same plane. In practice, a heavy-atom donor–acceptor distance below ~3.4 Å is used as a cutoff for a valid hydrogen bond between bases ( Tools for the automatic identification and classification of RNA base pairs - PMC ). At least one such H-bond (and preferably two or more) should link the G and C. Meanwhile, the angle between the base planes (the angle between their normal vectors) should be relatively small – typically under ~65° – so that the bases are near-coplanar ( Tools for the automatic identification and classification of RNA base pairs - PMC ). In your example, a G–C pair with H-bond distance <3.4 Å and a 46° inter-plane angle would satisfy these criteria: the bond length is within hydrogen-bonding range and the bases, though tilted, are still reasonably coplanar (46° is below common cutoff values). Such a pair is likely a genuine interaction rather than mere stacking. By contrast, if the bases were almost perpendicular (angle approaching 90°) or too offset (large vertical separation), the H-bond would be geometrically strained and the interaction might be considered invalid as a base pair ( Tools for the automatic identification and classification of RNA base pairs - PMC ) ((PDF) DSSR: An integrated software tool for dissecting the spatial structure of RNA). In summary, both a short H-bond and a fairly planar orientation are needed: one without the other is usually insufficient for a stable RNA base pair.

Leontis–Westhof Classification and Geometric Limits

Leontis–Westhof (L–W) classification defines RNA base pairs by the interacting edges of each base (Watson–Crick, Hoogsteen, or Sugar edge) and the orientation (cis or trans) of the glycosidic bonds (). Importantly, the original L–W scheme was intended for well-formed base pairs, typically those with at least two hydrogen bonds connecting the bases (). It assumes the bases interact edge-to-edge in a planar fashion. While Leontis and Westhof did not rigidly specify an angle cutoff in their 2001 paper, “planar edge-to-edge” implies the bases should be nearly coplanar. In practice, later implementations of L–W classification established clear geometric standards. For example, RNAview (Yang et al., 2003) – a tool that fully implements L–W classification – requires the base–base angle αb to be less than 65° for an interaction to count as a base pair ( Tools for the automatic identification and classification of RNA base pairs - PMC ). If the bases are tilted more than that, the interaction is no longer considered a standard base pair but something else (like a stacking or tertiary contact). Likewise, RNAview requires the base planes to be separated by <2.5 Å vertically ( Tools for the automatic identification and classification of RNA base pairs - PMC ), reinforcing that the bases must lie in essentially the same plane.

Crucially, L–W’s “12 basic geometric families” cover only those pairs with ≥2 H-bonds in roughly planar geometry. A 46° inter-base angle, for instance, is within the tolerated range and would not on its own disqualify a G–C pair from being classified (it’s below the ~65° cutoff). In fact, L–W classification includes many non-Watson–Crick G–C pairings (e.g. reverse G–C, Hoogsteen-type) that often have tilted geometry yet still form two robust H-bonds. The scheme’s scope does not typically extend to interactions with only one hydrogen bond, because those are considered either “bifurcated” pairs or tertiary contacts rather than one of the 12 standard families () ( Tools for the automatic identification and classification of RNA base pairs - PMC ). For borderline cases, there are stricter guidelines: Yang et al. note that if a base pair’s two H-bonds come from a single donor (a bifurcated bond), a tighter coplanarity cutoff of <50° is applied to accept it as one of the 12 families ( Tools for the automatic identification and classification of RNA base pairs - PMC ). This ensures only reasonably planar bifurcated pairs (which are inherently weaker) are included. Conversely, a near-zero angle with too large a separation is treated as mere stacking and rejected as a base pair (e.g. αb<10° but bases >2.2 Å apart vertically is considered stacking) ( Tools for the automatic identification and classification of RNA base pairs - PMC ). In summary, L–W classification is meant for bona fide base pairs with good geometry – typically two or more H-bonds and a planar alignment. While the original scheme didn’t dictate an exact angle value, tools that implement it consistently use an upper bound around 60–65° for the base-plane angle ( Tools for the automatic identification and classification of RNA base pairs - PMC ), thereby operationalizing the “planarity” concept.

Handling by Annotation Tools: MC-Annotate, RNAView, FR3D, and DSSR

Different RNA structure analysis tools employ these geometric criteria in slightly varied ways to detect and classify base pairs, especially non-canonical ones:

Despite minor differences, all these tools broadly agree on core principles. Notably, canonical Watson–Crick pairs (like standard G≡C) are recognized easily by all. Discrepancies arise mainly with non-canonical or borderline cases. For instance, one study showed that while these methods concur on ~80–90% of base pairs, their annotations of non-canonical pairs can vary a lot ([CompAnnotate: a comparative approach to annotate base-pairing interactions in RNA 3D structures Nucleic Acids Research Oxford Academic](https://academic.oup.com/nar/article/45/14/e136/3875524#:~:text=MC,with%20the%20annotation%20of%20benchmark)). MC-Annotate might report more single-H-bond pairs (due to its permissive approach) whereas RNAView might ignore those or mark them as tertiary, and DSSR might list them but classify as “other.” FR3D, being tuned for motif discovery, often emphasizes recurring valid interactions, possibly filtering out random one-off contacts. When formulating your own criteria, it’s wise to lean on the common ground that these tools have established: require at least one strong H-bond and a roughly planar alignment. You can then decide how strict to be (e.g. insist on two H-bonds for calling it a structured base pair, as L–W originally does, or allow one H-bond if you simply want to catalog any contact).

Standardizing Base Pair Criteria (Literature-Supported)

Drawing from the above, we can outline a universal rule set for identifying and classifying RNA base pairs, backed by authoritative sources:

By following these guidelines – drawn from authoritative literature and validated in software – you can confidently judge G–C base pair validity across all RNA structures. In essence, demand the presence of one or two good hydrogen bonds and near-planar geometry for base pairs ((PDF) DSSR: An integrated software tool for dissecting the spatial structure of RNA). The references above (Leontis & Westhof 2001; Major et al. 2002; Yang et al. 2003; Lu et al. 2015, among others) provide a strong foundation to justify these rules in any publication or analysis. Adhering to such standardized criteria will make your base-pair annotations consistent with community norms and tools, ensuring that your G–C pairs (and other base pairs) are identified and classified in line with established RNA structural biology practices ( Tools for the automatic identification and classification of RNA base pairs - PMC ) ( RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire - PMC ).

References: Key sources include Leontis & Westhof (2001) for the base pair classification framework (), Major et al. (2002) introducing MC-Annotate’s H-bond probability method ( RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire - PMC ), Yang et al. (2003) describing RNAview’s geometric criteria ( Tools for the automatic identification and classification of RNA base pairs - PMC ) ( Tools for the automatic identification and classification of RNA base pairs - PMC ), and Lu et al. (2015) detailing DSSR’s all-inclusive base-pair annotation approach ((PDF) DSSR: An integrated software tool for dissecting the spatial structure of RNA). These, along with additional analyses ( RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire - PMC ) ( Tools for the automatic identification and classification of RNA base pairs - PMC ), collectively underpin the recommended rules for RNA base pair validation. Each of the mentioned tools and studies converges on the principle that distance + planarity + H-bonds are the triad for base-pair identification, providing a well-founded, generalizable standard for all RNA structural analyses.