High-resolution, high bandwidth data
Unknown molecule discovery
Unified encodings
Data compression
Mixture separation
$J, K_a, \lambda, F, \omega, +$
$A,B,C \rightarrow \nu, I$
Machine learning as an attractive method for data processing;
Models are only as good as how the data is represented!
Frequency vs. intensity may be intuitive for humans, but not for machines.
Energy nodes connected through transition nodes
Energy nodes connected through transition edges
Symmetry properties
Non-uniformity
Inductive bias
Learning on multiple scales: from local neighborhoods to graphs
10,000 rigid rotor spectra from SPCAT
Uniform sampling in $\kappa$ with scale invariant $A,B,C$
$E, J, K_a, K_c$ embedded per node
Can process 230 graphs per second on Nvidia 3070
83,000 nodes, ~330,000 edges per batch (32 full graphs)
Aggregates information from up to K neighbors per node
$\rightarrow A, B, C$
What do graph neural networks learn from spectroscopic graphs?
Use Uniform Manifold Approximation and Projection (UMAP)
Topology preserving 2D projection of high dimensional embeddings
Local patterns
Large scale patterns
Analysis of prolate, oblate, and asymmetric top topology
Graph layout differs with asymmetry: sparsity and boundaries
Topology of node embeddings contain energy information
Topology of node embeddings contain quantum number information
Topology of node embeddings contain neighborhood information
2000 graphs from validation set
Graph embeddings contain asymmetry information
How accurate/precise are the models in reproducing spectroscopic parameters and linkage?
Dependent variables for spectroscopic parameter estimation
Simple A/B testing indicates >95% ROC AUC score (i.e. correct linkage prediction 95% of the time)
Energy levels are incredibly sparse—not the true error!
Need to improve on edge training sample scheme
Applying graph principles toward automating spectral analysis
Graph neural networks able to learn information-rich node and graph embeddings
Linkage prediction is far from accurate—need to revise training strategy
Applying machine learning to molecular spectra
Use graph representations of rotational spectra
Graph/node embeddings successfully capture spectroscopic intuition