IBM is entering a crowded and rapidly evolving market of small language models (SLMs), competing with offerings like Qwen3, ...
The proof-of-concept could pave the way for a new class of AI debuggers, making language models more reliable for business-critical applications.
The backbone breaker benchmark (b3) is a new evaluation tool created by the AISI, Check Point and Check Point subsidiary ...
(NASDAQ: CHKP), a pioneer and global leader of cyber security solutions, and Lakera, a world leading AI-native security platform for Agentic AI applications, with researchers from The UK AI Security ...
LiteLLM allows developers to integrate a diverse range of LLM models as if they were calling OpenAI’s API, with support for fallbacks, budgets, rate limits, and real-time monitoring of API calls. The ...
ETH Zurich and EPFL’s open-weight LLM offers a transparent alternative to black-box AI built on green compute and set for public release. Large language models (LLMs), which are neural networks that ...
Qwen 2.5 Coder/Max is currently the top open-source model for coding, with the highest HumanEval (~70–72%), LiveCodeBench (70.7), and Elo (2056) scores among open models. DeepSeek V3/Coder V2 remains ...
One way AI can improve on human work Computer scientists at UC Berkeley say that AI models show promise as a way to discover ...
In January, assumptions around AI were shaken up by DeepSeek, a small Chinese company that nobody had heard of. This week it was Switzerland’s turn to stir things up. Apertus (Latin for ‘open’) is a ...
I’ve been writing about the democratic future of large language models (LLMs). Will this tech turn out to be an inherently centralized, authoritarian technology like nuclear power, or a more ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. This article dives into the happens-before ...