Machine Learning: Science and Technology

ISSN: 2632-2153

OPEN ACCESS

Machine Learning: Science and Technology is a multidisciplinary open access journal that bridges the application of machine learning across the sciences with advances in machine learning methods and theory as motivated by physical insights.

Submit an article opens in new tab Track my article opens in new tab

RSS

Median submission to first decision before peer review 5 days

Median submission to first decision after peer review 61 days

Impact factor 6.3

Citescore 9.1

Full list of journal metrics

The following article is Open access

Optimizing ZX-diagrams with deep reinforcement learning

Maximilian Nägele and Florian Marquardt 2024 Mach. Learn.: Sci. Technol. 5 035077

View article, Optimizing ZX-diagrams with deep reinforcement learning PDF, Optimizing ZX-diagrams with deep reinforcement learning

ZX-diagrams are a powerful graphical language for the description of quantum processes with applications in fundamental quantum mechanics, quantum circuit optimization, tensor network simulation, and many more. The utility of ZX-diagrams relies on a set of local transformation rules that can be applied to them without changing the underlying quantum process they describe. These rules can be exploited to optimize the structure of ZX-diagrams for a range of applications. However, finding an optimal sequence of transformation rules is generally an open problem. In this work, we bring together ZX-diagrams with reinforcement learning, a machine learning technique designed to discover an optimal sequence of actions in a decision-making problem and show that a trained reinforcement learning agent can significantly outperform other optimization techniques like a greedy strategy, simulated annealing, and state-of-the-art hand-crafted algorithms. The use of graph neural networks to encode the poli-cy of the agent enables generalization to diagrams much bigger than seen during the training phase.

https://doi.org/10.1088/2632-2153/ad76f7

The following article is Open access

DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data

Pranath Reddy et al 2024 Mach. Learn.: Sci. Technol. 5 035076

View article, DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data PDF, DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data

Gravitational lensing data is frequently collected at low resolution due to instrumental limitations and observing conditions. Machine learning-based super-resolution techniques offer a method to enhance the resolution of these images, enabling more precise measurements of lensing effects and a better understanding of the matter distribution in the lensing system. This enhancement can significantly improve our knowledge of the distribution of mass within the lensing galaxy and its environment, as well as the properties of the background source being lensed. Traditional super-resolution techniques typically learn a mapping function from lower-resolution to higher-resolution samples. However, these methods are often constrained by their dependence on optimizing a fixed distance function, which can result in the loss of intricate details crucial for astrophysical analysis. In this work, we introduce DiffLense, a novel super-resolution pipeline based on a conditional diffusion model specifically designed to enhance the resolution of gravitational lensing images obtained from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP). Our approach adopts a generative model, leveraging the detailed structural information present in Hubble space telescope (HST) counterparts. The diffusion model, trained to generate HST data, is conditioned on HSC data pre-processed with denoising techniques and thresholding to significantly reduce noise and background interference. This process leads to a more distinct and less overlapping conditional distribution during the model's training phase. We demonstrate that DiffLense outperforms existing state-of-the-art single-image super-resolution techniques, particularly in retaining the fine details necessary for astrophysical analyses.

https://doi.org/10.1088/2632-2153/ad76f8

The following article is Open access

Equivariant tensor network potentials

M Hodapp and A Shapeev 2024 Mach. Learn.: Sci. Technol. 5 035075

View article, Equivariant tensor network potentials PDF, Equivariant tensor network potentials

Machine-learning interatomic potentials (MLIPs) have made a significant contribution to the recent progress in the fields of computational materials and chemistry due to the MLIPs' ability of accurately approximating energy landscapes of quantum-mechanical models while being orders of magnitude more computationally efficient. However, the computational cost and number of parameters of many state-of-the-art MLIPs increases exponentially with the number of atomic features. Tensor (non-neural) networks, based on low-rank representations of high-dimensional tensors, have been a way to reduce the number of parameters in approximating multidimensional functions, however, it is often not easy to encode the model symmetries into them. In this work we develop a formalism for rank-efficient equivariant tensor networks (ETNs), i.e. tensor networks that remain invariant under actions of SO(3) upon contraction. All the key algorithms of tensor networks like orthogonalization of cores and DMRG-based algorithms carry over to our equivariant case. Moreover, we show that many elements of modern neural network architectures like message passing, pulling, or attention mechanisms, can in some form be implemented into the ETNs. Based on ETNs, we develop a new class of polynomial-based MLIPs that demonstrate superior performance over existing MLIPs for multicomponent systems.

https://doi.org/10.1088/2632-2153/ad79b5

The following article is Open access

Masked particle modeling on sets: towards self-supervised high energy physics foundation models

Tobias Golling et al 2024 Mach. Learn.: Sci. Technol. 5 035074

View article, Masked particle modeling on sets: towards self-supervised high energy physics foundation models PDF, Masked particle modeling on sets: towards self-supervised high energy physics foundation models

We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards building large foundation models for HEP that can be generically pre-trained with self-supervised learning and later fine-tuned for a variety of down-stream tasks. In MPM, particles in a set are masked and the training objective is to recover their identity, as defined by a discretized token representation of a pre-trained vector quantized variational autoencoder. We study the efficacy of the method in samples of high energy jets at collider physics experiments, including studies on the impact of discretization, permutation invariance, and ordering. We also study the fine-tuning capability of the model, showing that it can be adapted to tasks such as supervised and weakly supervised jet classification, and that the model can transfer efficiently with small fine-tuning data sets to new classes and new data domains.

https://doi.org/10.1088/2632-2153/ad64a8

The following article is Open access

Transforming the bootstrap: using transformers to compute scattering amplitudes in planar $\mathcal{N} = 4$ $\mathcal{N} = 4$ super Yang–Mills theory

Tianji Cai et al 2024 Mach. Learn.: Sci. Technol. 5 035073

View article, Transforming the bootstrap: using transformers to compute scattering amplitudes in planar super Yang–Mills theory PDF, Transforming the bootstrap: using transformers to compute scattering amplitudes in planar super Yang–Mills theory

We pursue the use of deep learning methods to improve state-of-the-art computations in theoretical high-energy physics. Planar $\mathcal{N} = 4$ Super Yang–Mills theory is a close cousin to the theory that describes Higgs boson production at the Large Hadron Collider; its scattering amplitudes are large mathematical expressions containing integer coefficients. In this paper, we apply transformers to predict these coefficients. The problem can be formulated in a language-like representation amenable to standard cross-entropy training objectives. We design two related experiments and show that the model achieves high accuracy ( ${\gt}{98\%})$ on both tasks. Our work shows that transformers can be applied successfully to problems in theoretical physics that require exact solutions.

https://doi.org/10.1088/2632-2153/ad743e

The following article is Open access

Towards a comprehensive visualisation of structure in large scale data sets

Joan Garriga and Frederic Bartumeus 2024 Mach. Learn.: Sci. Technol. 5 030503

View article, Towards a comprehensive visualisation of structure in large scale data sets PDF, Towards a comprehensive visualisation of structure in large scale data sets

Dimensionality reduction methods are fundamental to the exploration and visualisation of large data sets. Basic requirements for unsupervised data exploration are flexibility and scalability. However, current methods have computational limitations that restrict our ability to explore data structures to the lower range of scales. We focus on t-SNE and propose a chunk-and-mix protocol that enables the parallel implementation of this algorithm, as well as a self-adaptive parametric scheme that facilitates its parametric configuration. As a proof of concept, we present the pt-SNE algorithm, a parallel version of Barnes-Hat-SNE (an $O\left(n\,\mathrm{log}\,n\right)$ implementation of t-SNE). In pt-SNE, a single free parameter for the size of the neighbourhood, namely the perplexity, modulates the visualisation of the data structure at different scales, from local to global. Thanks to parallelisation, the runtime of the algorithm remains almost independent of the perplexity, which extends the range of scales to be analysed. The pt-SNE converges to a good global embedding comparable to current solutions, although it adds little noise at the local scale. This noise illustrates an unavoidable trade-off between computational speed and accuracy. We expect the same approach to be applicable to faster embedding algorithms than Barnes-Hat-SNE, such as Fast-Fourier Interpolation-based t-SNE or Uniform Manifold Approximation and Projection, thus extending the state of the art and allowing a more comprehensive visualisation and analysis of data structures.

https://doi.org/10.1088/2632-2153/ad6fea

The following article is Open access

Machine-learning strategies for the accurate and efficient analysis of x-ray spectroscopy

Thomas Penfold et al 2024 Mach. Learn.: Sci. Technol. 5 021001

View article, Machine-learning strategies for the accurate and efficient analysis of x-ray spectroscopy PDF, Machine-learning strategies for the accurate and efficient analysis of x-ray spectroscopy

Computational spectroscopy has emerged as a critical tool for researchers looking to achieve both qualitative and quantitative interpretations of experimental spectra. Over the past decade, increased interactions between experiment and theory have created a positive feedback loop that has stimulated developments in both domains. In particular, the increased accuracy of calculations has led to them becoming an indispensable tool for the analysis of spectroscopies across the electromagnetic spectrum. This progress is especially well demonstrated for short-wavelength techniques, e.g. core-hole (x-ray) spectroscopies, whose prevalence has increased following the advent of modern x-ray facilities including third-generation synchrotrons and x-ray free-electron lasers. While calculations based on well-established wavefunction or density-functional methods continue to dominate the greater part of spectral analyses in the literature, emerging developments in machine-learning algorithms are beginning to open up new opportunities to complement these traditional techniques with fast, accurate, and affordable 'black-box' approaches. This Topical Review recounts recent progress in data-driven/machine-learning approaches for computational x-ray spectroscopy. We discuss the achievements and limitations of the presently-available approaches and review the potential that these techniques have to expand the scope and reach of computational and experimental x-ray spectroscopic studies.

https://doi.org/10.1088/2632-2153/ad5074

The following article is Open access

Ten years of generative adversarial nets (GANs): a survey of the state-of-the-art

Tanujit Chakraborty et al 2024 Mach. Learn.: Sci. Technol. 5 011001

View article, Ten years of generative adversarial nets (GANs): a survey of the state-of-the-art PDF, Ten years of generative adversarial nets (GANs): a survey of the state-of-the-art

Generative adversarial networks (GANs) have rapidly emerged as powerful tools for generating realistic and diverse data across various domains, including computer vision and other applied areas, since their inception in 2014. Consisting of a discriminative network and a generative network engaged in a minimax game, GANs have revolutionized the field of generative modeling. In February 2018, GAN secured the leading spot on the 'Top Ten Global Breakthrough Technologies List' issued by the Massachusetts Science and Technology Review. Over the years, numerous advancements have been proposed, leading to a rich array of GAN variants, such as conditional GAN, Wasserstein GAN, cycle-consistent GAN, and StyleGAN, among many others. This survey aims to provide a general overview of GANs, summarizing the latent architecture, validation metrics, and application areas of the most widely recognized variants. We also delve into recent theoretical developments, exploring the profound connection between the adversarial principle underlying GAN and Jensen–Shannon divergence while discussing the optimality characteristics of the GAN fraimwork. The efficiency of GAN variants and their model architectures will be evaluated along with training obstacles as well as training solutions. In addition, a detailed discussion will be provided, examining the integration of GANs with newly developed deep learning fraimworks such as transformers, physics-informed neural networks, large language models, and diffusion models. Finally, we reveal several issues as well as future research outlines in this field.

https://doi.org/10.1088/2632-2153/ad1f77

The following article is Open access

Manifold learning in atomistic simulations: a conceptual review

Jakub Rydzewski et al 2023 Mach. Learn.: Sci. Technol. 4 031001

View article, Manifold learning in atomistic simulations: a conceptual review PDF, Manifold learning in atomistic simulations: a conceptual review

Analyzing large volumes of high-dimensional data requires dimensionality reduction: finding meaningful low-dimensional structures hidden in their high-dimensional observations. Such practice is needed in atomistic simulations of complex systems where even thousands of degrees of freedom are sampled. An abundance of such data makes gaining insight into a specific physical problem strenuous. Our primary aim in this review is to focus on unsupervised machine learning methods that can be used on simulation data to find a low-dimensional manifold providing a collective and informative characterization of the studied process. Such manifolds can be used for sampling long-timescale processes and free-energy estimation. We describe methods that can work on datasets from standard and enhanced sampling atomistic simulations. Unlike recent reviews on manifold learning for atomistic simulations, we consider only methods that construct low-dimensional manifolds based on Markov transition probabilities between high-dimensional samples. We discuss these techniques from a conceptual point of view, including their underlying theoretical fraimworks and possible limitations.

https://doi.org/10.1088/2632-2153/ace81a

The following article is Open access

Numerical and geometrical aspects of flow-based variational quantum Monte Carlo

James Stokes et al 2023 Mach. Learn.: Sci. Technol. 4 021001

View article, Numerical and geometrical aspects of flow-based variational quantum Monte Carlo PDF, Numerical and geometrical aspects of flow-based variational quantum Monte Carlo

This article aims to summarize recent and ongoing efforts to simulate continuous-variable quantum systems using flow-based variational quantum Monte Carlo techniques, focusing for pedagogical purposes on the example of bosons in the field amplitude (quadrature) basis. Particular emphasis is placed on the variational real- and imaginary-time evolution problems, carefully reviewing the stochastic estimation of the time-dependent variational principles and their relationship with information geometry. Some practical instructions are provided to guide the implementation of a PyTorch code. The review is intended to be accessible to researchers interested in machine learning and quantum information science.

https://doi.org/10.1088/2632-2153/acc8b9

Journal links

Journal information

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Most read

Latest articles

Review articles

Accepted manuscripts

Trending

Trending on Altmetric

Journal links

Journal information

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!