Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Kavli Affiliate: Wei Gao | First 5 Authors: Kai Huang, Hanyun Yin, Heng Huang, Wei Gao, | Summary: Fine-tuning is the most effective way of adapting pre-trained large language models (LLMs) to downstream applications. With the fast growth of LLM-enabled AI applications and democratization of open-souced LLMs, fine-tuning has become possible for non-expert individuals, but […]


Continue.. Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Kavli Affiliate: Wei Gao | First 5 Authors: Kai Huang, Hanyun Yin, Heng Huang, Wei Gao, | Summary: Fine-tuning is the most effective way of adapting pre-trained large language models (LLMs) to downstream applications. With the fast growth of LLM-enabled AI applications and democratization of open-souced LLMs, fine-tuning has become possible for non-expert individuals, but […]


Continue.. Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Deep Learning with Photonic Neural Cellular Automata

Kavli Affiliate: Alireza Marandi | First 5 Authors: Gordon H. Y. Li, Christian R. Leefmans, James Williams, Robert M. Gray, Midya Parto | Summary: Rapid advancements in deep learning over the past decade have fueled an insatiable demand for efficient and scalable hardware. Photonics offers a promising solution by leveraging the unique properties of light. […]


Continue.. Deep Learning with Photonic Neural Cellular Automata

Exact coherent structures in two-dimensional turbulence identified with convolutional autoencoders

Kavli Affiliate: Michael P. Brenner | First 5 Authors: Jacob Page, Joe Holey, Michael P. Brenner, Rich R. Kerswell, | Summary: Convolutional autoencoders are used to deconstruct the changing dynamics of two-dimensional Kolmogorov flow as $Re$ is increased from weakly chaotic flow at $Re=40$ to a chaotic state dominated by a domain-filling vortex pair at […]


Continue.. Exact coherent structures in two-dimensional turbulence identified with convolutional autoencoders

mixed attention auto encoder for multi-class industrial anomaly detection

Kavli Affiliate: Feng Wang | First 5 Authors: Jiangqi Liu, Feng Wang, , , | Summary: Most existing methods for unsupervised industrial anomaly detection train a separate model for each object category. This kind of approach can easily capture the category-specific feature distributions, but results in high storage cost and low training efficiency. In this […]


Continue.. mixed attention auto encoder for multi-class industrial anomaly detection

ICM-SHOX. Paper I: Methodology overview and discovery of a baryon–dark matter velocity decoupling in the MACS J0018.5+1626 merger

Kavli Affiliate: Sunil Golwala | First 5 Authors: Emily M. Silich, Elena Bellomi, Jack Sayers, John ZuHone, Urmila Chadayammuri | Summary: Galaxy cluster mergers are rich sources of information to test cluster astrophysics and cosmology. However, cluster mergers produce complex projected signals that are difficult to interpret physically from individual observational probes. Multi-probe constraints on […]


Continue.. ICM-SHOX. Paper I: Methodology overview and discovery of a baryon–dark matter velocity decoupling in the MACS J0018.5+1626 merger

ICM-SHOX. Paper I: Methodology overview and discovery of a gas–dark matter velocity decoupling in the MACS J0018.5+1626 merger

Kavli Affiliate: Sunil Golwala | First 5 Authors: Emily M. Silich, Elena Bellomi, Jack Sayers, John ZuHone, Urmila Chadayammuri | Summary: Galaxy cluster mergers are rich sources of information to test cluster astrophysics and cosmology. However, cluster mergers produce complex projected signals that are difficult to interpret physically from individual observational probes. Multi-probe constraints on […]


Continue.. ICM-SHOX. Paper I: Methodology overview and discovery of a gas–dark matter velocity decoupling in the MACS J0018.5+1626 merger

Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Kavli Affiliate: Dan Luo | First 5 Authors: Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu | Summary: Zero-shot text-to-speech (TTS) synthesis aims to clone any unseen speaker’s voice without adaptation parameters. By quantizing speech waveform into discrete acoustic tokens and modeling these tokens with the language model, recent language model-based TTS models […]


Continue.. Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Kavli Affiliate: Dan Luo | First 5 Authors: Shun Lei, Yixuan Zhou, Liyang Chen, Dan Luo, Zhiyong Wu | Summary: Zero-shot text-to-speech (TTS) synthesis aims to clone any unseen speaker’s voice without adaptation parameters. By quantizing speech waveform into discrete acoustic tokens and modeling these tokens with the language model, recent language model-based TTS models […]


Continue.. Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts

Reducing hardware requirements for entanglement distribution via joint hardware-protocol optimization

Kavli Affiliate: Stephanie Wehner | First 5 Authors: AdriĆ  Labay-Mora, Francisco Ferreira da Silva, Stephanie Wehner, , | Summary: We conduct a numerical investigation of fiber-based entanglement distribution over distances of up to 1600km using a chain of processing-node quantum repeaters. We determine minimal hardware requirements while simultaneously optimizing over protocols for entanglement generation and […]


Continue.. Reducing hardware requirements for entanglement distribution via joint hardware-protocol optimization