Leanne Test Publish
When new post published, it should trigger Vercel build (on staging).
When new post published, it should trigger Vercel build (on staging).
Nvidia’s Nemotron adds reasoning to Llama models. Does ChatGPT make frequent users more lonely? OpenAI’s o1-pro costs a pretty penny. Mistral Small 3.1 gives Gemma 3 27B some competition.
Nvidia gives Project DIGITS a new name. AI models compete to build Minecraft items. Claude chatbot now includes search. A Moore’s law-like regularity for AI agents.
The Batch AI News and Insights: Last Friday on Pi Day, we held AI Dev 25, a new conference for AI Developers.
Last Friday on Pi Day, we held AI Dev 25, a new conference for AI Developers.
Materials that have specific properties are essential to progress in critical technologies like solar cells and batteries. A machine learning model designs new materials to order.
The United States Copyright Office determined that existing laws are sufficient to decide whether a given AI-generated work is protected by copyright, making additional legislation unnecessary.
An AI agent synthesizes novel scientific research hypotheses. It's already making an impact in biomedicine.
Multilingual AI models often suffer uneven performance across languages, especially in multimodal tasks. A pair of lean models counters this trend with consistent understanding of text and images across major languages.
Google’s two new Gemini vision-language-action robotics models. Cohere’s Command A, another lightweight LMM. New China regulations require mandatory labels for AI content. Monitoring reasoning models for reward hacking or unwanted behavior.
OpenAI’s new SDK and APIs for agentic workflows. Olympic Coder, two powerful open coding models. Alibaba applies RL to emotion detection. GPT-4.5 and Claude Sonnet 3.7 top a new agent leaderboard.
Some people today are discouraging others from learning programming on the grounds AI will automate it.
The Batch Newsletter
The Batch AI News and Insights: Some people today are discouraging others from learning programming on the grounds AI will automate it.
Tech & Society
Large language models built by developers in China may, in some applications, be less useful outside that country because they avoid topics its government deems politically sensitive. A developer fine-tuned DeepSeek-R1 to widen its scope without degrading its overall performance.
Tech & Society
A United States court delivered a major ruling that begins to answer the question whether, and under what conditions, training an AI system on copyrighted material is considered fair use that doesn’t require permission.
Machine Learning Research
Microsoft debuted its first official large language model that responds to spoken input.
Machine Learning Research
Most models that have learned to reason via reinforcement learning were huge models. A much smaller model now competes with them.
Data Points
Music and lyrics in one diffusion model. Manus AI’s impressive demos spark excitement and backlash. OpenAI sees AGI as a gradual evolution. Google unveils its first Gemini-branded embedding models.
Data Points
Cohere’s open vision models support many languages. Jamba 1.6’s two hybrid MoE models promise more speed. Anthropic overhauls its developer console for Claude Sonnet 3.7. Mistral brings its multilingual/multimedia skills to OCR.
Letters
Continuing our discussion on the Voice Stack, I’d like to explore an area that today’s voice-based systems mostly struggle with: Voice Activity Detection (VAD) and the turn-taking paradigm of communication.
The Batch Newsletter
The Batch AI News and Insights: Continuing our discussion on the Voice Stack, I’d like to explore an area that today’s voice-based systems mostly struggle with: Voice Activity Detection (VAD) and the turn-taking paradigm of communication.
Tech & Society
Amazon announced Alexa+, a major upgrade to its long-running voice assistant.
Machine Learning Research
Anthropic’s Claude 3.7 Sonnet implements a hybrid reasoning approach that lets users decide how much thinking they want the model to do before it renders a response.
Machine Learning Research
OpenAI launched GPT-4.5, which may be its last non-reasoning model.