Topic

Research

Research papers, benchmarks, and technical reports

11 articles

Abstract editorial illustration of an AI agent silhouette inside a glowing runtime security shield, navy and teal palette, no humans, no humanoid figures, no readable text.

Straiker raises $64M to secure the AI agent workforce

Straiker, the agentic security startup, has closed a $64M Series A led by Marathon to build discovery, pre-deployment testing, and runtime protection for the AI agent workforce.

AIntelligenceHubJune 29, 2026

Abstract recursive neural network loop with glowing blue and cyan data streams feeding back into a central luminous node on a deep navy background, representing AI training AI

AI Research

Andrej Karpathy Joins Anthropic to Use Claude to Train the Next Claude

Andrej Karpathy, OpenAI co-founder and former Tesla AI director, is joining Anthropic's pre-training team. His mandate: use Claude to accelerate the research that produces the next Claude.

AIntelligenceHubMay 20, 2026

Abstract AI interface showing flight options with one highlighted in amber, representing AI conflict of interest in sponsored recommendations

AI Research

AI Agents Favored $1,500 Sponsored Flights Over $500 Alternatives in a New Study

A Princeton and UW study tested 23 AI models with sponsor incentives. Eighteen of 23 recommended the expensive sponsored flight over cheaper options more than half the time.

AIntelligenceHubMay 17, 2026

Glowing digital document fragmenting into data particles against a dark blue background, representing AI-driven document corruption during long-running tasks

AI Research

AI Agents Corrupt Your Documents During Long Tasks, Microsoft Researchers Find

Microsoft tested 19 AI models on complex document editing across 52 professional fields. Frontier models corrupted 25 percent of content during long sessions. Adding agentic tools made outcomes worse, not better.

AIntelligenceHubMay 11, 2026

Abstract visualization of an AI agent in a reflective dream state, reviewing glowing memory nodes and session patterns in a digital neural landscape

AI Developer Tools

Anthropic Lets Claude Agents Dream to Learn From Their Own Mistakes

Anthropic introduced dreaming to Claude Managed Agents on May 6, alongside outcomes grading and multiagent orchestration. Legal AI company Harvey saw task completion rates jump roughly 6x in early tests.

AIntelligenceHubMay 11, 2026

A lab bench with glowing molecular structures projected above glassware in a modern biotech workspace

AI Research

OpenAI Introduced GPT-Rosalind for Drug Discovery and Biology Research

OpenAI launched GPT-Rosalind, a model family built for life sciences teams that need stronger biological reasoning, tool use, and literature synthesis across multi-step research workflows.

AIntelligenceHubApril 18, 2026

A large neural network structure compressing into compact efficient modules after training inside a precise laboratory scene

AI Research

Fujitsu Open-Sourced a Toolkit to Shrink LLMs After Training

Fujitsu released One Compression, an open-source post-training quantization toolkit, to help teams reduce model size and serving cost while preserving practical quality targets.

AIntelligenceHubApril 3, 2026

Red attacker pathways and blue defensive shields colliding around an autonomous agent control node in a research setting

Large Language Models Safety

OpenClaw Security Papers Show How Agent Attacks and Defenses Are Evolving

March 2026 OpenClaw-related security research highlighted both attack paths and defense techniques for agentic systems, reinforcing that deployment safety now depends on ongoing adversarial testing.

AIntelligenceHubApril 1, 2026

Multiple AI agents feeding signals into one training loop with lightning-like optimization paths and no code rewrite imagery

Research Tools

Microsoft Agent Lightning Targets Agent Training Without Full Rewrites

Microsoft positioned Agent Lightning as a way to improve existing agents without rewriting whole stacks, a practical pitch for teams that already have automation systems in production.

AIntelligenceHubApril 1, 2026

Parallel compute lanes accelerating toward one output chip to represent faster AI inference under load

Large Language Models Research

Together AI Says Aurora Made Inference About 25% Faster in Its Tests

Together AI introduced Aurora on April 1, 2026 and said it achieved an added 1.25x speed gain over a static speculative decoding baseline by learning from live traces.

AIntelligenceHubApril 1, 2026

Abstract software engineering workspace with agent workflow nodes and code panels

Research Tools

Composer 2 Focuses on Long Coding Tasks, Not Just One-Shot Prompts

The Composer 2 technical report argues that coding agents should be trained and measured on long, tool-heavy software tasks instead of short single-turn prompt responses.

AIntelligenceHubApril 1, 2026