Biased Reaction Optimization

This design defines the replacement for the current Turbomole-dependent AFIR optimization path in react. Reaction optimization must be an independent PyAR service that can use xTB, machine-learning potentials, or quantum chemistry backends without assigning coordinate-update policy to a backend.

Current Problem

The compatibility route through pyar.backends.xtb_turbo evaluates an xTB gradient, adds the AFIR contribution, writes the modified energy and gradient into Turbomole files, and asks Turbomole to update the coordinates and check convergence. This has three design consequences:

an xTB/AFIR reaction calculation requires a Turbomole installation
optimization strategy is hidden inside a backend wrapper rather than owned by the reaction workflow
the current xtb_turbo and Turbomole AFIR implementations set gamma = 100.0 internally, instead of using the gamma schedule supplied by react

pyar.biases.afir.isotropic already provides the important reusable piece: additive AFIR energy and Cartesian force evaluation. The geomeTRIC adapter uses that force directly instead of coupling it to Turbomole file rewriting.

Target Objective

The reaction workflow constructs one objective from a physical energy/force provider and zero or more additive bias potentials:

\[E_\mathrm{total}(x) = E_\mathrm{method}(x) + E_\mathrm{AFIR}(x)\]

\[F_\mathrm{total}(x) = F_\mathrm{method}(x) + F_\mathrm{AFIR}(x)\]

The optimizer receives only this objective. The implemented providers are xTB, AIMNet2, ORCA, and Gaussian; other engines require energy/force adapters before they can use this channel. The AFIR gamma value belongs to the bias settings stored in the request and run state; it must never be replaced by a wrapper-level constant.

AFIR Bias

AFIR adds an artificial force term to the objective to drive fragments together or apart while the optimizer follows the combined energy landscape. In PyAR, the AFIR bias is an additive term on top of the backend energy and forces, so the workflow can explore reaction channels, weak complexes, and ligand-association events without giving the backend control over coordinate updates.

The implementation here follows the isotropic AFIR formulation described by Maeda et al. [AFIR2016].

Core Interfaces

The initial contracts should be small and testable:

@dataclass(frozen=True)
class EnergyGradientResult:
    energy_hartree: float
    gradient_hartree_per_bohr: np.ndarray

class EnergyGradientProvider(Protocol):
    def evaluate(
        self, molecule: Molecule, coordinates_bohr: np.ndarray
    ) -> EnergyGradientResult: ...

class BiasPotential(Protocol):
    def evaluate(
        self, molecule: Molecule, coordinates_bohr: np.ndarray
    ) -> EnergyGradientResult: ...

class GeometryOptimizer(Protocol):
    def minimize(
        self, objective: Objective, molecule: Molecule, settings: OptimizerSettings
    ) -> OptimizationResult: ...

Objective sums the provider and bias results, counts evaluations, logs physical and bias energies separately, and records all optimizer steps for restart and diagnosis.

The current registry wires this contract through backend-specific provider factories. The provider owns backend settings such as charge, multiplicity, and executable details; the objective only sees the result object above.

Units are part of this contract:

coordinates at the objective boundary: Bohr
energies: Hartree
Cartesian forces or gradients: Hartree/Bohr, with sign translated explicitly at the optimizer boundary
stored XYZ and user-facing geometry coordinates: Angstrom

Conversions should occur once at the objective boundary. This aligns the new objective with the units already expected by the current AFIR calculation.

Internal-Coordinate Optimizer Choice

Cartesian LBFGS and FIRE are useful baselines and fallbacks, but they do not satisfy the primary design requirement: efficient molecular optimization in internal coordinates. The implemented xTB, AIMNet2, ORCA, and Gaussian AFIR reaction channels use geomeTRIC with TRIC coordinates.

TRIC adds explicit translation and rotation coordinates for each molecular fragment alongside intrafragment internal coordinates. This matches an AFIR reaction workflow in which independently oriented fragments approach and rearrange while covalent bonding changes. GeomeTRIC also accepts a custom energy/Cartesian-gradient engine, using Hartree and Hartree/Bohr units, which matches the objective contract above.

Candidate implementations should be treated as follows:

Optimizer	Coordinate model	PyAR role
geomeTRIC/TRIC	Delocalized intrafragment internals plus explicit fragment translation and rotation coordinates	Proposed default for biased reaction minimization and the first integration target
PyBerny	Redundant internal coordinates with RFO, trust radius, and Hessian updates	Secondary Python benchmark for bonded molecular cases; not the default for fragment-association paths
DL-FIND	DLC and HDLC, plus Cartesian modes	Strong reference implementation and optional later integration, particularly for transition-state extensions
xTB ANCopt/L-ANCopt	Approximate normal coordinates internal to xTB	Native unbiased xTB benchmark; usable for AFIR only if an additive external-gradient integration is demonstrated
ASE LBFGS/FIRE	Cartesian steps, optionally preconditioned	Simple fallback and performance baseline; not the AFIR production default

ASE remains valuable as a calculator interoperability layer. For example, an AIMNet2 or other ASE-compatible potential can be adapted to the common energy/gradient provider. It should not force the reaction workflow to use a Cartesian ASE optimizer.

Proposed Package Boundary

The new core should be built under these logical boundaries before modules are physically moved:

biases/
  base.py             # BiasPotential protocol
  afir.py             # AFIR energy and gradient implementation
optimizers/
  base.py             # OptimizerSettings and GeometryOptimizer protocol
  objective.py        # Provider plus bias composition and logging
  geometric.py        # geomeTRIC/TRIC adapter
  berny.py            # optional benchmark adapter
  ase_cartesian.py    # LBFGS/FIRE fallback and benchmark adapter
  dlfind.py           # optional future integration
backends/
  xtb.py              # xTB energy/gradient provider and unbiased shortcut
  aimnet2.py          # ML energy/gradient provider
  ...
workflows/
  reaction.py         # gamma schedule, product tests, state, reporting

The intended execution path is:

reaction request
  -> trial geometry and gamma schedule
  -> provider = xTB/AIMNet2/QC energy and gradient
  -> objective = provider + AFIR(gamma)
  -> geomeTRIC/TRIC optimizer
  -> product identity analysis
  -> structured state and reports

Current Restart Foundation

Reaction calculations now persist versioned state in reaction/state.json and geometry snapshots under reaction/state/geometries/. The state records the numeric gamma schedule, pending and retained current-cycle geometries, completed jobs, products, and invocation settings. Resume is rejected if the new invocation changes scientific inputs or backend settings.

GeomeTRIC-backed AFIR runs also write a lightweight path trace under each job directory in reaction_trace/. The trace contains a JSONL record for every backend evaluation plus matching XYZ snapshots, and successful paths add a path_summary.csv file together with candidate_ts/ geometries for the highest backend energy, highest total energy, pre-product, and largest bond change points.

The trace-analysis output has the following structure:

reaction_trace/
  trace.jsonl
  steps/step_*.xyz
path_summary.csv
candidate_ts/
  highest_backend_energy.xyz
  highest_total_energy.xyz
  pre_product_geometry.xyz
  max_bond_change.xyz
  metadata.json
trace_plots/
  reaction_profile.png

backend_energy_hartree is the physical backend energy without AFIR. total_energy_hartree includes the AFIR bias and is the optimization objective followed by geomeTRIC. Relative energies in path_summary.csv are reported relative to the first recorded trace frame, not relative to the minimum along the trace. This is useful for following the energetic climb from the AFIR starting geometry. A later analysis layer may additionally report minimum-referenced profiles.

Candidate geometries receive a candidate_ts/metadata.json sidecar and a richer XYZ comment with the selected step index and energy context. Files in candidate_ts/ are candidate geometries for later path refinement. They are not confirmed transition states. Confirmation requires a separate NEB, string, dimer, or TS optimization and an IRC or downhill connectivity check. Trace recording is restart-safe: resumed runs append to the existing trace instead of deleting it, and successful products keep a compact reference to the generated trace-analysis outputs in the reaction state file.

Legacy jobs.pkl checkpoints are migrated only when their old gamma labels map unambiguously to the requested numeric schedule. This gives the later internal-coordinate optimizer an inspectable and atomic restart foundation.

Implemented Channel And Next Steps

For react calculations using --software xtb, --software aimnet_2, --software orca, or --software gaussian, PyAR now selects the geomeTRIC/TRIC optimizer and evaluates the physical energy/forces together with the AFIR force. In practice, ORCA and Gaussian require the corresponding executable and should be validated on the target installation. When a bonded candidate is found, product relaxation is repeated without AFIR bias (gamma=0.0). Native optimizers remain the default for aggregate and ordinary backend optimization workflows.

The remaining development work is to store optimizer step and objective component details in restart state, benchmark this channel on controlled reaction cases, add quantum-chemistry force-provider adapters, and implement transition-state optimization as a separate validated workflow.

Validation And Benchmarks

The default cannot be chosen solely from algorithm descriptions. The integration must report:

AFIR analytic gradient agreement with finite differences
final energy and product identity for each successful optimization
energy/gradient evaluation count, optimizer step count, wall time, and convergence reason
results under multiple gamma values and multiple initial orientations
restart equivalence after an interrupted optimization
failure behavior for disconnected fragments, failed backend evaluations, and non-convergent trials

The initial benchmark matrix should compare:

geomeTRIC/TRIC with xTB plus AFIR
PyBerny with the same objective where fragment handling permits
ASE LBFGS and FIRE with the same objective as Cartesian baselines
native xTB ANCopt for unbiased reference optimizations
DL-FIND/DLC or HDLC as a later reference if installation and integration are practical

Algorithm References

The implementation should cite the algorithm that is actually used:

L.-P. Wang and C. Song, Geometry optimization made simple with translation and rotation coordinates, Journal of Chemical Physics 144, 214108 (2016), DOI 10.1063/1.4952956. This defines TRIC and geomeTRIC.
J. Kastner, J. M. Carr, T. W. Keal, W. Thiel, A. Wander, and P. Sherwood, DL-FIND: An Open-Source Geometry Optimizer for Atomistic Simulations, Journal of Physical Chemistry A 113, 11856-11865 (2009), DOI 10.1021/jp9028968. This documents DLC/HDLC-capable optimizer infrastructure.
C. Peng, P. Y. Ayala, H. B. Schlegel, and J. M. Frisch, Using redundant internal coordinates to optimize equilibrium geometries and transition states, Journal of Computational Chemistry 17, 49-56 (1996), DOI 10.1002/(SICI)1096-987X(19960115)17:1<49::AID-JCC5>3.0.CO;2-0. This is a direct basis for redundant-internal-coordinate Berny-style approaches.

The new implementation should integrate and test an established internal-coordinate optimizer rather than reimplement the Pulay/Schlegel coordinate transformation machinery from scratch.

[AFIR2016]

Satoshi Maeda, Yu Harabuchi, Makito Takagi, Tetsuya Taketsugu, and Keiji Morokuma, Artificial Force Induced Reaction (AFIR) Method for Exploring Quantum Chemical Potential Energy Surfaces, Chemical Record 16(5), 2232-2248 (2016), DOI 10.1002/tcr.201600043.