Attacking Quantum Models with AI: When Can Truncated Neural Networks Deliver Results?

Currently, computing technologies are rapidly evolving and reshaping how we imagine the future. Quantum computing is taking its first toddling steps toward delivering practical results that promise unprecedented abilities. Meanwhile, artificial intelligence remains in public conversation as it’s used for everything from writing business emails to generating bespoke images or songs from text prompts to producing deep fakes.

Some physicists are exploring the opportunities that arise when the power of machine learning—a widely used approach in AI research—is brought to bear on quantum physics. Machine learning may accelerate quantum research and provide insights into quantum technologies, and quantum phenomena present formidable challenges that researchers can use to test the bounds of machine learning.

When studying quantum physics or its applications (including the development of quantum computers), researchers often rely on a detailed description of many interacting quantum particles. But the very features that make quantum computing potentially powerful also make quantum systems difficult to describe using current computers. In some instances, machine learning has produced descriptions that capture the most significant features of quantum systems while ignoring less relevant details—efficiently providing useful approximations.An artistic rendering of a neural network consisting of two layers. The top layer represents a real collection of quantum particles, like atoms in an optical lattice. The connections with the hidden neurons below account for the particles’ interactions. (Credit: Modified from original artwork created by E. Edwards/JQI)An artistic rendering of a neural network consisting of two layers. The top layer represents a real collection of quantum particles, like atoms in an optical lattice. The connections with the hidden neurons below account for the particles’ interactions. (Credit: Modified from original artwork created by E. Edwards/JQI)

In a paper published May 20, 2024, in the journal Physical Review Research, two researchers at JQI presented new mathematical tools that will help researchers use machine learning to study quantum physics. And using these tools, they have identified new opportunities in quantum research where machine learning can be applied.

“I want to understand the limit of using traditional classical machine learning tools to understand quantum systems,” says JQI graduate student Ruizhi Pan, who was the first author of the paper.

The standard tool for describing collections of quantum particles is the wavefunction, which provides a complete description of the quantum state of the particles. But obtaining the wavefunction for more than a handful of particles tends to require impractical amounts of time and resources.

Researchers have previously shown that AI can approximate some families of quantum wavefunctions using fewer resources. In particular, physicists, including CMTC Director and JQI Fellow Sankar Das Sarma, have studied how to represent quantum states using neural networks—a common machine learning approach in which webs of connections handle information in ways reminiscent of the neurons firing in a living brain. Artificial neural networks are made of nodes—sometimes called artificial neurons—and connections of various strengths between them.

Today, neural networks take many forms and are applied to diverse applications. Some neural networks analyze data, like inspecting the individual pixels of a picture to tell if it contains a person, while others model a process, like generating a natural-sounding sequence of words given a prompt or selecting moves in a game of chess. The webs of connections formed in neural networks have proven useful at capturing hard-to-identify relationships, patterns and interactions in data and models, including the unique interactions of quantum particles described by wavefunctions.

But neural networks aren’t a magic solution to every situation or even to approximating every wavefunction. Sometimes, to deliver useful results, the network would have to be too big and complex to practically implement. Researchers need a strong theoretical foundation to understand when they are useful and under what circumstances they fall prey to errors.

In the new paper, Pan and JQI Fellow Charles Clark investigated a type of neural network called a restricted Boltzmann machine (RBM), in which the nodes are split into two layers and connections are only allowed between nodes in different layers. One layer is called the visible, or input, layer, and the second is called the hidden layer, since researchers generally don’t directly manipulate or interpret it as much as they do the visible layer.

“The restricted Boltzmann machine is a concept that is derived from theoretical studies of classical ‘spin glass’ systems that are models of disordered magnets,” Clark says. “In the 1980s, Geoffrey Hinton and others applied them to the training of artificial neutral networks, which are now widely used in artificial intelligence. Ruizhi had the idea of using RBMs to study quantum spin systems, and it turned out to be remarkably fruitful.”

For RBM models of quantum systems, physicists frequently use each node of the visible layer to represent a quantum particle, like an individual atom, and use the connections made through the hidden layer to capture the interactions between those particles. As the size and complexity of quantum states grow, a neural net increasingly needs more and more hidden nodes to keep up, eventually becoming unwieldy.

However, the exact relationships between the complexity of a quantum state, the number of hidden nodes used in a neural network, and the resulting accuracy of the approximation are difficult to pin down. This lack of clarity is an example of the black box problem that permeates the field of machine learning. It exists because researchers don’t meticulously engineer the intricate web of a neural network but instead rely on repeated steps of trial and error to find connections that work. This approach often delivers more accurate or efficient results than researchers know how to achieve by working from first principles, but it doesn’t explain why the connections that make up the neural network deliver the desired result—so the results might as well have come from a black box. This built-in inscrutability makes it difficult for physicists to know which quantum models are practical to tackle with neural networks.

Pan and Clark decided to peek behind the veil of the hidden layer and investigate how neural networks boil down the essence of quantum wavefunctions. To do this, they focused on neural network models of a one-dimensional line of quantum spins. A spin is like a little magnetic arrow that wants to point along a magnetic field and is key to understanding how magnets, superconductors and most quantum computers function.

Spins naturally interact by pushing and pulling on each other. Through chains of interactions, even two distant spins can become correlated—meaning that observing one spin also provides information about the other spin. All the correlations between particles tend to drive quantum states into unmanageable complexity. 

Pan and Clark did something that at first glance might not seem relevant to the real world: They imagined and analyzed a neural network that uses infinitely many hidden nodes to model a fixed number of spins.

“In reality of course we don't hope to use a neural network with an infinitely large system size,” Pan says. “We often want to use finite size neural networks to do the numerical computations, so we need to analyze the effects of doing truncations.”

Pan and Clark already knew that using more hidden nodes generally produced more accurate results, but the research community only had a fuzzy understanding of how the accuracy suffers when fewer hidden nodes are used. By backing up and getting a view of the infinite case, Pan and Clark were able to describe the hypothetical, perfectly accurate representation and observe the contributions made by the infinite addition of hidden nodes. The nodes don’t all contribute equally. Some capture the basics of significant features, while many contribute small corrections.

The pair developed a method that sorts the hidden nodes into groups based on how much correlation they capture between spins. Based on this approach, Pan and Clark developed mathematical tools for researchers to use when developing, comparing and interpreting neural networks. With their new perspective and tools, Pan and Clark identified and analyzed the forms of errors they expect to arise from truncating a neural network, and they identified theoretical limits on how big the errors can get in various circumstances. 

In previous work, physicists generally relied on restricting the number of connections allowed for each hidden node to keep the complexity of the neural network in check. This in turn generally limited the reach of interactions between particles that could be modeled—earning the resulting collection of states the name short-range RBM states.

Pan and Clark’s work revealed a chance to apply RBMs outside of those restrictions. They defined a new group of states, called long-range-fast-decay RBM states, that have less strict conditions on hidden node connections but that still often remain accurate and practical to implement. The looser restrictions on the hidden node connections allow a neural network to represent a greater variety of spin states, including ones with interactions stretching farther between particles.

“There are only a few exactly solvable models of quantum spin systems, and their computational complexity grows exponentially with the number of spins,” says Clark. “It is essential to find ways to reduce that complexity. Remarkably, Ruizhi discovered a new class of such systems that are efficiently attacked by RBMs. It’s the old hero-returns-home story: from classical spin glass came the RBM, which grew up among neural networks, and returned home with a gift of order to quantum spin systems.”

The pair’s analysis also suggests that their new tools can be adapted to work for more than just one-dimensional chains of spins, including particles arranged in two or three dimensions. The authors say these insights can help physicists explore the divide between states that are easy to model using RBMs and those that are impractical. The new tools may also guide researchers to be more efficient at pruning a network’s size to save time and resources. Pan says he hopes to further explore the implications of their theoretical framework.

“I'm very happy that I realized my goal of building our research results on a solid mathematical basis,” Pan says. “I'm very excited that I found such a research field which is of great prospect and in which there are also many unknown problems to be solved in the near future.”

Original story by Bailey Bedford: https://jqi.umd.edu/news/attacking-quantum-models-ai-when-can-truncated-neural-networks-deliver-results

US Joins FCC Effort: Maryland’s Impact

On April 26, 2024, a joint “Statement of Intent between the United States of America and the European Organization for Nuclear Research (CERN) concerning Future Planning for Large Research Infrastructure Facilities, Advanced Scientific Computing, and Open Science” was signed at The White House.  The US-CERN SOI was signed by Deirdre Mulligan, The White House Principal Deputy Chief Technology Officer, and Fabiola Gianotti, the CERN Director-General.   Among other topics, the SOI expresses an intention by the United States to collaborate on a future FCC Higgs Factory, the “Future Circular Collider”, should the CERN Member States determine the project feasible. The planned FCC.The planned FCC

University of Maryland Professor Sarah Eno has played a leading role in establishing US participation in the physics and detectors of the FCC.  Appointed by CERN in 2020 as one of two US representative to the “physics, detector, and experiments” executive committee (with Dmitri Denisov of Brookhaven National Laboratory) Eno spearheaded physics input to the decadal planning process for particle physics, known as the P5 process.  The resulting white paper summarized the exciting physics potential of this facility.  In the resulting P5 report,  US participation in an offshore Higgs factory was recommended.  Recently Eno presented the status of US involvement at the FCC workshop in Annecy, France.  With this announcement, the US will start its formal participation in the development of this international facility.  

Photo from the signing showing from left-to-right: Abid Patwa (DOE), Chris Marcum (The White House OMB and Open Science Point), Deidre Mulligan (The White House PDCTO), Fabiola Gianotti (CERN DG), Rahima Kandahari (US State Department Deputy Assistant Secretary for Science, Technology, and Space Affairs), and Saul Gonzalez (NSF).Photo from the signing showing from left-to-right: Abid Patwa (DOE), Chris Marcum (The White House OMB and Open Science Point), Deidre Mulligan (The White House PDCTO), Fabiola Gianotti (CERN DG), Rahima Kandahari (US State Department Deputy Assistant Secretary for Science, Technology, and Space Affairs), and Saul Gonzalez (NSF).The FCC is planned to be a circular particle accelerator with a circumference around 91 km.  In its first phase, it would collide electrons and positrons with center-of-mass energies and beam intensities that allow collection of the entire sample of the previous LEP electron-positron collider’s Z bosons in three minutes, as well as large samples of W bosons, top quarks, and Higgs particles. Civil construction could begin in the mid 2030s, with data taking in the 2040s.   The accelerator would be located around Geneva, Switzerland, in a tunnel passing near the Jura mountains and under Lake Geneva.  As part of its studies of the Higgs boson, the FCC will study potential connections between it and dark matter, and search for influence of new massive particles and other new physics on its decay properties.  In the future, the same tunnel could house a proton-proton collider similar to the LHC, but with a center-of-mass energy seven times higher.

Maryland has had an impactful participation in the effort.  Besides Eno’s participation in the PED executive committee,  UMD is the lead institution in a plan for a new type of electromagnetic calorimeter for the FCC detectors.  Assistant Professor Chris Palmer has also involved undergraduate students taking PHYS441  in studies of its potential physics impact, which were presented at the second annual US FCC meeting at MIT.  

See the Statement of Intent here and the U.S. Department of State announcement here.

Attendees at the 7th FCC physics workshop in Annecy France (https://indico.cern.ch/event/1307378/), including Professor Sarah Eno. Click for high-resolution photo.Attendees at the 7th FCC physics workshop in Annecy France (https://indico.cern.ch/event/1307378/), including Professor Sarah Eno

IceCube Observes Seven Astrophysical Tau Neutrino Candidates

Neutrinos are tiny, weakly interacting subatomic particles that can travel astronomical distances undisturbed. As such, they can be traced back to their sources, revealing the mysteries surrounding the cosmos. High-energy neutrinos that originate from the farthest reaches beyond our galaxy are called astrophysical neutrinos and are the main subject of study for the IceCube Neutrino Observatory, a cubic-kilometer-sized neutrino telescope at the South Pole. In 2013, IceCube presented its first evidence of high-energy astrophysical neutrinos originating from cosmic accelerators, beginning a new era in astronomy. 

These cosmic messengers come in three different flavors: electron, muon, and tau, with astrophysical tau neutrinos being exceptionally difficult to pin down. Now, in a new study recently accepted as an “Editors’ Suggestion” by Physical Review Letters, the IceCube Collaboration presents the discovery of the once-elusive astrophysical tau neutrinos, a new kind of astrophysical messenger. 

IceCube detects neutrinos using cables (strings) of digital optical modules (DOMs), with a total of 5,160 DOMs embedded deep within the Antarctic ice. When neutrinos interact with molecules in the ice, charged particles are produced that then emit blue light while traveling through the ice, which is then registered and digitized by the individual DOMs. The light produces distinctive patterns, one of which is double cascade events from high-energy tau neutrino interactions within the detector.

The production of a double pulse waveform. The photons from a neutrino interaction (blue) arrive at the top middle DOM at time tI, producing the first peak in the waveform, while photons from the tau lepton decay (purple) arrive at the same DOM at time tD, producing the second peak. Credit: Jack Pairin/IceCube CollaborationThe production of a double pulse waveform. The photons from a neutrino interaction (blue) arrive at the top middle DOM at time tI, producing the first peak in the waveform, while photons from the tau lepton decay (purple) arrive at the same DOM at time tD, producing the second peak. Credit: Jack Pairin/IceCube Collaboration

Since prior IceCube analyses saw hints from searches for subtle signatures produced by astrophysical tau neutrinos, the researchers remained motivated to pinpoint tau neutrinos. After rendering each event into three images (see figure below), they trained convolutional neural networks (CNNs) optimized for image classification to distinguish images produced by tau neutrinos from images produced by various backgrounds. After having simulations run that confirmed its sensitivity to tau neutrinos, the technique was then applied to 10 years of IceCube data acquired between 2011 and 2020. The result was seven strong candidate tau neutrino events. 

“The detection of seven candidate tau neutrino events in the data, combined with the very low amount of expected background, allows us to claim that it is highly unlikely that backgrounds are conspiring to produce seven tau neutrino imposters,” said Doug Cowen, a professor of physics at Penn State University and one of the study leads. “The discovery of astrophysical tau neutrinos also provides a strong confirmation of IceCube’s earlier discovery of the diffuse astrophysical neutrino flux.”

Candidate astrophysical tau neutrino detected on November 13, 2019. Each column corresponds to one of the three neighboring strings of the selected event. Each figure in the top row shows the DOM number, proportional to the depth, versus the time of the digitized PMT signal in 3-ns bins, with the bin color corresponding to the size of the signal in each time bin, for each of the three strings. The total number of photons detected by each string is provided at the upper left in each figure. In the most-illuminated string (left column), the arrival of light from two cascades is visible as two distinct hyperbolas. The bottom row of figures shows the “saliency” for one of the CNNs for each of the three strings. The saliency shows where changes in light level have the greatest impact on the value of the CNN score. The black line superimposed on the saliency plots shows where the light level goes to zero and is effectively an outline of the figures in the top row. The saliency is largest at the leading and trailing edges of the light emitted by the two tau neutrino cascades, showing that the CNN is mainly sensitive to the overall structure of the event. Credit: IceCube CollaborationCandidate astrophysical tau neutrino detected on November 13, 2019. Each column corresponds to one of the three neighboring strings of the selected event. Each figure in the top row shows the DOM number, proportional to the depth, versus the time of the digitized PMT signal in 3-ns bins, with the bin color corresponding to the size of the signal in each time bin, for each of the three strings. The total number of photons detected by each string is provided at the upper left in each figure. In the most-illuminated string (left column), the arrival of light from two cascades is visible as two distinct hyperbolas. The bottom row of figures shows the “saliency” for one of the CNNs for each of the three strings. The saliency shows where changes in light level have the greatest impact on the value of the CNN score. The black line superimposed on the saliency plots shows where the light level goes to zero and is effectively an outline of the figures in the top row. The saliency is largest at the leading and trailing edges of the light emitted by the two tau neutrino cascades, showing that the CNN is mainly sensitive to the overall structure of the event. Credit: IceCube CollaborationCowen added that the probability of the background mimicking the signal was estimated to be less than one in 3.5 million. 

UMD Research Scientist Erik Blaufuss served as an internal reviewer for the analysis, carefully studying the methods and techniques used to make the discovery. Assistant Professor Brian Clark leads the scientific working group in IceCube that produced the result. The IceCube collaboration includes several UMD faculty, including  Kara Hoffman, Greg Sullivan, and Michael Larson, in addition to several graduate students and postdocs. The UMD group plays a leading role in the maintenance and operations of the detector, as well as the simulation and analysis of the data. 

Future analyses will incorporate more of IceCube’s strings, since this study used just three of them. The new analysis would increase the sample of tau neutrinos that can then be used to perform the first three-flavor study of neutrino oscillations—the phenomenon where neutrinos change flavors—over cosmological distances. This type of study could address questions such as the mechanism of neutrino production from astrophysical sources and the properties of space through which neutrinos travel. 

Currently, there is no tool specifically designed to determine the energy and direction of tau neutrinos that produce the signatures seen in this analysis. Such an algorithm could be used to better differentiate a potential tau neutrino signal from background and to help identify candidate tau neutrinos in real time at the South Pole. Similar to current IceCube real-time alerts issued for other neutrino types, alerts for tau neutrinos could be issued to the astronomical community for follow-up studies.

All in all, this exciting discovery comes with the “intriguing possibility of leveraging tau neutrinos to uncover new physics,” said Cowen. 

+ info “Observation of Seven Astrophysical Tau Neutrino Candidates with IceCube,” The IceCube Collaboration: R. Abbasi et al. Accepted by Physical Review Letters. arxiv.org/abs/2403.02516

Original story by 

Ana Maria Rey to Speak at Graduate Commencement Ceremony

For Ana Maria Rey (Ph.D. ’04, physics), the path to a highly successful career as a theoretical physicist and researcher began more than three decades ago in her home country of Colombia, with an inspiring high school physics teacher, the brilliance of Isaac Newton and her own boundless curiosity.

Ana Maria Rey.  Courtesy of same.Ana Maria Rey. Courtesy of same. Ana Maria Rey. Photo courtesy of same. Click image to download hi-res version.

I had a physics teacher in high school, and he was amazing, he taught me about Newton’s laws of motion,” Rey recalled. “I was so excited that I could write an equation and predict the behavior of objects that I kept asking him to give me books so I could keep working and solve more problems because it was all so interesting to me.”

A few years later, after Rey earned her bachelor’s degree in physics in Colombia and began her Ph.D. in physics at the University of Maryland, her future as a scientist began to come into sharper focus. Thanks to a game-changing connection with Distinguished University Professor of Physics and Nobel laureate William Phillips, Rey charted a course toward breakthrough research in atomic and molecular physics and laid the groundwork for a fruitful collaboration with the National Institute of Standards and Technology (NIST), one that is still going strong today.

“For me, NIST and the University of Maryland are just blended. I can’t separate them because they’re so connected,” Rey explained. “UMD gave me a strong foundation, all my research experience, the collaboration experience, everything I learned about how I should talk to experimentalists was thanks to the University of Maryland/NIST partnership. My research was going on at NIST but this only could happen because I was at UMD, so I think when I decided to go to the University of Maryland it was one of the best decisions I have ever made.”

Currently an adjunct professor at the University of Colorado Boulder, Rey has been a NIST Fellow since 2017 and a Fellow of JILA, the joint physics institute of CU Boulder and NIST, since 2012. She has earned a host of prestigious awards including a MacArthur Fellowship and the 2014 Presidential Early Career Award for Scientists and Engineers. In 2019, Rey became the first Hispanic woman to win the Blavatnik National Award for Young Scientists, and in 2023 she was elected to the National Academy of Sciences, one of the highest professional honors for a scientist.   

A big, big honor

In May 2024, nearly 20 years after earning her Ph.D., Rey will return to Maryland—a place that is still very close to her heart—to deliver the keynote speech at the Graduate Commencement Ceremony for UMD’s College of Computer, Mathematical, and Natural Sciences. She’s honored and humbled by the invitation.

“It’s a big, big honor. For me it was really special to feel that a university that has been so important in my career, to determining who I am, has asked me to give a speech like this,” Rey said. “I feel like inspiring students is one of my biggest roles and that’s why I find so touching the possibility to give the commencement speech because maybe this is a way I can tell them how I feel and try to encourage them to make the most of their future.”

A Nobel Prize-winning inspiration

When Rey began her graduate work at UMD she planned to pursue research in plasma physics—that is, until she attended an inspiring lecture by Phillips about his pioneering work with atoms and lasers.

“I heard Bill’s talk about how he was cooling atoms with light, and I found it fascinating. He was fantastic.  I approached my plasma physics advisor Adil Hasam afterward and I told him, ‘I feel that this is the direction that I want to pursue’ and he was totally supportive,” Rey recalled. “He encouraged me to reach out to Charles Clark, who at that time was the chief of the electron optical physics division at NIST and that is what I did.  Charles was very welcoming and told me that I could start working on ultra-cold atoms trapped in periodic potentials using lasers. That is how my Ph.D. adventure started.”

From then on, Rey’s research efforts really took off. At UMD and NIST and later at JILA, her work has focused on atomic, molecular and optical physics as well as condensed matter physics and quantum information science, setting the stage for one of her proudest achievements—her contribution to the most accurate atomic clock ever created.

“My goal is to try to understand the deepest secrets of the universe and try to use them for something useful,” Rey explained. “Understanding the collisions with these atoms has allowed us to create a clock that is really one of the best timekeepers that we have ever been able to construct, and we can now predict that they offer many other, unique possibilities.”

“You need to be excited”

A leading researcher in the Quantum Systems Accelerator, Rey has published more than 200 papers. And, after more than two decades of research, she’s still as motivated and excited about her work as she was the day it all began.

“As a scientist, you have to work a lot to make progress, so you absolutely need to be excited,” Rey explained. “I love it. Every day I’m surrounded by so many exciting experiments and so many things that I need to learn and understand. And every time that I learn something new, it really makes my day.”

Rey hopes she can share that excitement when she speaks to students and their families at the CMNS Graduate Commencement Ceremony in May. Her goal is to inspire the next generation of scientists to create success stories of their own.

“I would like to serve as a role model the way others have done for me,” Rey said. “If I am able to inspire new generations to become physicists, to advance science and do better, that’s one of my great ambitions, and it would be a great honor to feel that I’m doing that. So, my message to them is you have now in your hands the possibility to make a change in the world, so use all the knowledge that you’ve acquired to make that happen.“