
Tech & Society
Audio Generation Clear of Copyrights: Stability AI releases enhanced text-to-audio generator Stable Audio Open
Sonically minded developers gained a high-profile text-to-audio generator.
Tech & Society
Sonically minded developers gained a high-profile text-to-audio generator.
Machine Learning Research
A relatively small student LLM that learns to mimic a larger teacher model can perform nearly as well as the teacher while using much less computation. It can come even closer if the teacher also teaches reasoning techniques.
Machine Learning Research
Text excerpts used in retrieval augmented generation (RAG) tend to be short. Researchers used summarization to pack more relevant context into the same amount of text.
Machine Learning Research
The latest text-to-image generators can alter images in response to a text prompt, but their outputs often don’t accurately reflect the text. They do better if, in addition to a prompt, they’re told the general type of alteration they’re expected to make.
Machine Learning Research
Brain-to-computer interfaces that enable users to control robots with their thoughts typically execute a single type of task such as reaching and grasping. Researchers designed a system that responds to a variety of intentions.
Machine Learning Research
It’s not necessary to activate all parts of a large language model to process a given input. Using only the necessary parts saves processing.
Letters
Inexpensive token generation and agentic workflows for large language models (LLMs) open up intriguing new possibilities for training LLMs on synthetic data...
Machine Learning Research
A new AI method directs scientists toward promising avenues of inquiry.
Machine Learning Research
Video diffusion provides a new basis for generating 3D models.
Machine Learning Research
Retrieval-augmented generation (RAG) enables large language models to generate better output by retrieving documents that are relevant to a user’s prompt. Fine-tuning further improves RAG performance.
Machine Learning Research
An architectural innovation improves upon transformers — up to 2 billion parameters, at least...
Machine Learning Research
Large language models sometimes generate false statements. New work makes them more likely to produce factual output.