AI News · refreshing

Curating today's signal…

Top 20 AI stories daily — research, engineering & community. Sources: Anthropic, OpenAI, DeepMind, Google Gemini, xAI, Meta AI, Mistral, arXiv, HN, Papers With Code and more. Refreshed at 7am · 1pm · 7pm ET. No influencer noise.

Now · May 2026 LIVE

What I'm doing right now.

HiveQ in production, agentic AI platform where the agents do the work, not just chat about it
Faiston One in the works, a team of agents on call for observability and incident response
At MIT, executive Chief Digital Officer & AI Agentic program
Writing "The Agentic Executive", my next training for leaders deciding where to bet on AI

Office Hours

Office hours on AI, for builders + leaders

I keep 4 slots open each month for free 45-min calls, whether you're an engineer shipping AI features or an executive trying to figure out where to bet. Bring a real problem. I'll bring 32 years of pattern-matching plus hands-on AI work in production.

2 of 4 open · May 2026 Apply for a slot →

More headlines.

curated by AI · 3x/day

Research · arXiv

Papers worth reading.

Latest from cs.AI · cs.LG · cs.CL — what researchers are actually working on, not what VC twitter says is hot. Refreshed daily.

Fetching papers from arXiv…

↗ See all on arXiv

AI Events · USA · Global · Online

Where the builders are.

Curated events for practitioners — no vendor pitches, no hype stages. Technical conferences, community meetups, and elite gatherings where real AI work gets discussed.

+ Suggest an event

I'll be there

JUN 29 – JUL 2 · 2026

AI Engineer World's Fair

San Francisco, CA · USA

In-Person Engineering

JUN 7–12 · 2026

AI Con USA

Seattle, WA + Online · USA

Hybrid Engineering

SET 29 – OUT 1 · 2026

The AI Conference 2026

San Francisco, CA · USA

In-Person Research + Eng

DEZ 6–12 · 2026

NeurIPS 2026

Sydney, Australia · Satellite: Atlanta, USA

In-Person Research

RECORRENTE · Tampa

NANDA Tampa · MIT Research Meetup

Tampa, FL · USA

In-Person Engineering Free

MENSAL · Online

Office Hours com Fabio

Zoom · Global

Remote Builders + Leaders Free

Suggest an event

Know a great AI event? Tell me.

Em Produção · build log

What's actually shipping.

May 12

Claude-as-judge with structured rubric: 23-point precision jump on adverse-event classification. The gap between "AI feature" and "regulatory-grade tool" is the evaluation design.

May 7

CMDB reconciliation agent in production. First run flagged 4,200 shadow assets that 18 months of manual audits missed. Confidence threshold tuning is the whole game.

Apr 30

Hybrid RAG on Portuguese product catalog — vector + BM25 fusion, relevance finally converging. Under 800ms end-to-end. Bilingual edge cases were the hard part.

Apr 22

Agentic routing layer: LLM-scored confidence bands replaced rule-based escalation logic. 94% routing accuracy. Rule systems don't age well in production.

Apr 15

Model cost playbook locked in: Sonnet 4.6 for agentic loops, Haiku for high-volume classification, Opus for evals only. 68% token cost reduction without touching accuracy.

On X · @fabioshenrique

From the feed.

X · @fabioshenrique · live follow on X

Loading tweets…

Anthropic Intel · curated

What's shipping from Anthropic.

MAY 2026

Claude Sonnet 4.6 · Extended Thinking + Tool Use

Extended thinking now composable with tool calls in the same API request. Unlocks reasoning-while-acting patterns — agents that think before each tool step, not just at prompt time.

API Agentic Extended Thinking

APR 2026

Claude Design · Vision for Interface Agents

Anthropic's bet on computer-use and UI-aware agents maturing fast. Claude can now reason over screenshots, DOM trees, and interaction state simultaneously.

Computer Use Vision Cowork

APR 2026

200+ MCP Connectors · Ecosystem Momentum

The MCP connector ecosystem crossed 200 production-ready integrations. Slack, Gmail, Linear, GitHub, Notion — the distribution layer for agentic work is now real infrastructure.

MCP Ecosystem Integrations

MAR 2026

Strands SDK · Agent Orchestration Framework

AWS + Anthropic released the Strands SDK for Python. Model-agnostic orchestration, built-in tool registration, streaming-first. The cleaner alternative to LangChain for AWS-native stacks.

SDK AWS Orchestration

Claude · In Production

Why I build on Claude for enterprise work.

Extended thinking produces auditable reasoning chains — critical for regulated industries (health, finance, pharma).

Tool use with structured outputs is reliable enough to run unattended. No babysitting required.

Constitutional AI means safety constraints I can actually explain to compliance teams. Not just vibes.

The context window (200K) handles full enterprise documents without chunking hacks.

Bedrock deployment keeps data in-VPC. My FSI clients need that. No compromise.

Anthropic docs →

Teaching

Courses I can minister.

Claude Cowork: your own AI team, in your console.

6h · 3 sessions

Live online · individuals, teams, or corporate cohorts · no coding required

Most people use Claude like a chat window. The console is a different animal: plugins, skills, MCP connectors, persistent memory, file system access. Set it up right and you stop typing prompts, you orchestrate a small team of assistants that knows your calendar, your inbox, your docs, your tools. Two cohorts so far: builders who want to move 5x faster, and execs / ops teams who want results without writing code.

Connect Claude to your calendar, email, files and SaaS in one afternoon
Build workflows that save 5+ hours a week (morning brief, meeting prep, weekly digest, expense triage)
Plugins, skills, memory, connectors: the production setup, not the demo
Privacy, scopes and access patterns for company data

Open enrollment · monthly cohorts Request →

Learning from · deeplearning.ai

Always learning.

Short courses I'm currently working through or have on the shortlist, pulled live from the DeepLearning.AI catalog.

Loading courses…

↗ See all on DeepLearning.AI

Research · MIT Media Lab

The Internet of AI Agents.

NANDA, Networked AI Agents in Decentralized Architecture, is the MIT Media Lab initiative led by Prof. Ramesh Raskar, pioneering the open infrastructure that lets billions of specialized agents discover, communicate, negotiate, and transact autonomously across a decentralized web.

Think of it as the missing layer between today's siloed LLMs and tomorrow's agent economy: an indexing fabric, communication protocols, and economic primitives that turn AI from a chatbot product into actual infrastructure.

I'm an active participant in the NANDA research consortium, contributing across the agentic web stack, from index architecture to agent-to-agent protocols.

MIT Media Lab page → projectnanda.org → GitHub org → Events on lu.ma →

The vision

"Imagine billions of specialized AI agents collaborating across a decentralized architecture, each performing discrete functions, communicating seamlessly, navigating autonomously, socializing, learning, earning and transacting on our behalf."

, NANDA, MIT Media Lab

Massachusetts Institute of Technology

Concept · Roadmap

Three phases of the agentic web.

NANDA's roadmap maps the agentic internet across three stages, each unlocking the next. Today the consortium is building Phase 1 in the open, with reference implementations and white papers.

Index

Discovery & registry

A new index architecture so agents can find each other. Unlike search engines built for humans, the agent index has to be machine-first, low-latency, and adversarial-aware.

papers · phase 1.1 + 1.2 · open-source reference impl

Protocol

Communication & teaming

Agent-to-agent protocols and adapters that let heterogeneous agents (different vendors, different models, different intent) negotiate, delegate, and form short-lived teams to solve a task.

adapters · teaming · interop

Economy

Markets & co-learning

Knowledge pricing, agent stores, and large population models, the economic layer where agents earn, pay, and learn from each other without leaking data across silos.

marketplaces · pricing · co-learning

Research pillars

Six research pillars.

The consortium's research is organized into six interlocking workstreams. Each one ships papers, reference implementations, and open standards proposals.

Foundations & infrastructure

Core index architecture and the substrate every other workstream builds on.

Agent adapters & teaming

Protocols that let agents from different stacks negotiate roles and team up to handle multi-step tasks.

Edge AI & tiny AI

Running agentic workloads at the edge, phones, sensors, embedded, without sacrificing capability.

Knowledge pricing

Economic primitives so agents can pay for context, expertise, and computation in real time.

Co-learning across silos

Agents that learn from each other without exposing raw data, federated, privacy-aware, composable.

Agent stores & marketplaces

The discovery and distribution layer, analogous to app stores, but built for autonomous agents and large population models.

From the NANDA community

Live from the consortium.

Recent talks YouTube →

Loading…

Recent events lu.ma →

Loading…

GitHub activity GitHub →

Loading…