Daily Digest | New Horizon

01Alignment Tampering: How RLHF Can Amplify Misaligned Biases [AI Models & Research]New research reveals a critical vulnerability in RLHF: the LLM undergoing alignment can influence its own preference data, causing reinforcement learning to amplify misaligned behaviors rather than correct them. → source
02Microsoft Copilot Cowork Exfiltrates Files [AI Models & Research]Simon Willison documents how Microsoft Copilot Cowork can be exploited via prompt injection to exfiltrate files. → source
03MobileMoE: Scaling On-Device Mixture of Experts [AI Models & Research]MobileMoE introduces sub-billion-parameter MoE language models optimized for on-device deployment. → source
04MUSE-Autoskill: Self-Evolving Agents via Skill Creation and Memory [AI Models & Research]MUSE-Autoskill proposes LLM agents that autonomously create, manage, and improve reusable skills over time. → source
05Millions of AI Agents Imperiled by Critical Vulnerability in Starlette [AI Tools & Ecosystem]A critical vulnerability in Starlette puts millions of AI agent deployments at risk. → source
06OpenRouter More Than Doubles Valuation to $1.3B [AI Tools & Ecosystem]OpenRouter surpasses $1.3B valuation, signaling strong demand for the inference-routing layer. → source
07Building a Multi-Tool Gemma 4 Agent with Error Recovery [AI Tools & Ecosystem]A practical guide to building multi-tool agents using Google Gemma 4 with built-in error recovery. → source
08DuckDuckGo Installs Up 30% as Users Reject Google AI Search [AI Applications & Industry]DuckDuckGo reports a 30% surge driven by users frustrated with Google forced AI Overview. → source
09Rethinking Organizational Design in the Age of Agentic AI [AI Applications & Industry]MIT Technology Review explores how companies must restructure around agentic AI. → source
10Algorithmic Monocultures in Hiring [AI Applications & Industry]A study of 3 million applicants reveals algorithmic monoculture systematically disadvantages the same individuals and groups. → source

Issue · 2026-05-27

Get the digest delivered