
Tech & Society
Think D̶i̶f̶f̶e̶r̶e̶n̶t̶ Small: Apple releases OpenELM, a family of smaller large language models.
Apple is thinking small — very small — with a new family of open large language models.
Tech & Society
Apple is thinking small — very small — with a new family of open large language models.
Tech & Society
How well do large language models respond to professional-level queries in various industry domains? A new company aims to find out.
Machine Learning Research
Retrieval-augmented generation (RAG) enables large language models to generate better output by retrieving documents that are relevant to a user’s prompt. Fine-tuning further improves RAG performance.
Tech & Society
Language models can generate code that erroneously points to software packages, creating vulnerabilities that attackers can exploit.
Machine Learning Research
Large language models sometimes generate false statements. New work makes them more likely to produce factual output.
Tech & Society
Microsoft took over most of the once high-flying chatbot startup Inflection AI in an unusual deal.
Machine Learning Research
Research aims to help users select large language models that minimize expenses while maintaining quality.
Tech & Society
Security researchers sounded the alarm about holes in Hugging Face’s platform.
Machine Learning Research
Large language models are not good at math. Researchers devised a way to make them better. Tiedong Liu and Bryan Kian Hsiang Low at the National University of Singapore proposed a method to fine-tune large language models for arithmetic tasks.
Tech & Society
Google asserted its open source bona fides with new models. Google released weights for Gemma-7B, an 8.5 billion-parameter large language model intended to run GPUs, and Gemma-2B, a 2.5 billion-parameter version intended for deployment on CPUs and edge devices.
Tech & Society
European AI champion Mistral AI unveiled new large language models and formed an alliance with Microsoft.
Machine Learning Research
The combination of language models that are equipped for retrieval augmented generation can retrieve text from a database to improve their output. Further work extends this capability to retrieve information from any application that comes with an API.