Artificial intelligence and machine learning are currently affecting our lives in many small but impactful ways. For example, AI and machine learning applications recommend entertainment we might enjoy through streaming services such as Netflix and Spotify.
In the near future, it's predicted that these technologies will have an even larger impact on society through activities such as driving fully autonomous vehicles, enabling complex scientific research and facilitating medical discoveries.
But the computers used for AI and machine learning demand a lot of energy. Currently, the need for computing power related to these technologies is doubling roughly every three to four months. And cloud computing data centers used by AI and machine learning applications worldwide are already devouring more electrical power per year than some small countries. It's easy to see that this level of energy consumption is unsustainable.
A research team led by the University of Washington has developed new optical computing hardware for AI and machine learning that is faster and much more energy efficient than conventional electronics. The research also addresses another challenge — the 'noise' inherent to optical computing that can interfere with computing precision.
In a new paper, published Jan. 21 in Science Advances, the team demonstrates an optical computing system for AI and machine learning that not only mitigates this noise but actually uses some of it as input to help enhance the creative output of the artificial neural network within the system.
"We've built an optical computer that is faster than a conventional digital computer," said lead author Changming Wu, a UW doctoral student in electrical and computer engineering. "And also, this optical computer can create new things based on random inputs generated from the optical noise that most researchers tried to evade."
Optical computing noise essentially comes from stray light particles, or photons, that originate from the operation of lasers within the device and background thermal radiation. To target noise, the researchers connected their optical computing core to a special type of machine learning network, called a Generative Adversarial Network.
The team tested several noise mitigation techniques, which included using some of the noise generated by the optical computing core to serve as random inputs for the GAN.
For example, the team assigned the GAN the task of learning how to handwrite the number "7" like a person would. The optical computer could not simply print out the number according to a prescribed font. It had to learn the task much like a child would, by looking at visual samples of handwriting and practicing until it could write the number correctly. Of course the optical computer didn't have a human hand for writing, so its form of "handwriting" was to generate digital images that had a style similar to the samples it had studied, but were not identical to them.
"Instead of training the network to read handwritten numbers, we trained the network to learn to write numbers, mimicking visual samples of handwriting that it was trained on," said senior author Mo Li, a UW professor of electrical and computer engineering. "We, with the help of our computer science collaborators at Duke University, also showed that the GAN can mitigate the negative impact of the optical computing hardware noises by using a training algorithm that is robust to errors and noises. More than that, the network actually uses the noises as random input that is needed to generate output instances."
After learning from handwritten samples of the number seven, which were from a standard AI-training image set, the GAN practiced writing "7" until it could do it successfully. Along the way, it developed its own distinct writing style and could write numbers from one to 10 in computer simulations.
The next steps include building this device at a larger scale using current semiconductor manufacturing technology. So, instead of constructing the next version of the device in a lab, the team plans to use an industrial semiconductor foundry to achieve wafer-scale technology. A larger-scale device will further improve performance and allow the research team to do more complex tasks beyond handwriting generation such as creating artwork and even videos.
"This optical system represents a computer hardware architecture that can enhance the creativity of artificial neural networks used in AI and machine learning, but more importantly, it demonstrates the viability for this system at a large scale where noise and errors can be mitigated and even harnessed," Li said. "AI applications are growing so fast that in the future, their energy consumption will be unsustainable. This technology has the potential to help reduce that energy consumption, making AI and machine learning environmentally sustainable — and very fast, achieving higher performance overall."
Additional co-authors are Ruoming Peng, a UW doctoral student in electrical and computer engineering; Xiaoxuan Yang, a doctoral student at Duke University; Heshan Yu, a research associate at University of Maryland, College Park; Ichiro Takeuchi, a professor at University of Maryland, College Park; and Yiran Chen, a professor at Duke University. This research was funded by the Office of Naval Research, the National Science Foundation and the Army Research Office. Part of this work was conducted at the Washington Nanofabrication Facility at the UW.