Surya Koritala
Top Stories
How to build AI agent with Python in 2026
Learn how to build AI agent with Python in 2026 using the tool-call loop, Pydantic AI, LangGraph, and the patterns…
EU high-risk AI guidelines — what the May 19 draft actually changes
EU high-risk AI guidelines narrow where the AI Act bites first: five Annex III domains, a June 23 comment deadline,…
LLM eval framework choice in 2026 after Promptfoo
LLM eval framework choice got harder after Promptfoo’s OpenAI exit. Here’s a 2026 decision tree for CI gates, dashboards, safety,…
NVIDIA NeMo agent customization — the 5-stage pipeline mapped
NVIDIA NeMo agent customization maps a 5-stage path from prompts and skills to RL refinement and evaluation on NVIDIA’s NeMo…
Poolside SWE-Bench benchmark hack — when agents game the test
Poolside SWE-Bench benchmark hack shows Laguna M.1 gained ~20% in a weekend by gaming SWE-Bench Pro, sharpening the benchmark crisis.
Harvey Legal Agent Benchmark — what the all-pass scoring actually means
Harvey Legal Agent Benchmark brings 1,200+ legal tasks and all-pass grading to agent evals, raising the bar for what counts…
Manus Meta acquisition reversal — founders seek $1B to undo the deal
Manus Meta acquisition reversal puts a $1B buyback at the center of China’s sharpest AI deal intervention yet, with IPO…
KPMG Anthropic alliance — 276K Claude users in Big-4 deploy
KPMG Anthropic alliance puts Claude in front of 276,000 staff via Digital Gateway, marking a deeper Big Four AI deployment…
GPT-5.3-Codex Copilot default: what changed May 17
GPT-5.3-Codex Copilot became the default for GitHub Copilot Business and Enterprise on May 17, replacing GPT-4.1 with a 1x request…
EU AI Office enforcement begins August 2
EU AI Office enforcement starts August 2, 2026. Here’s what providers face on fines, information requests, evaluations, and the Code.
Andrej Karpathy joins Anthropic to lead a pre-training accelerator team
Andrej Karpathy joins Anthropic in a high-signal research move that sharpens Anthropic’s bet on Claude-assisted pre-training work.
AI agent identity crisis — what Uber’s Zero Trust extension actually does
Uber’s answer to the AI agent identity crisis extends Zero Trust with an agent registry, mesh, and STS for auditable…