News Nug

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO written from scratch in PyTorch - updates! [P]

r/MachineLearning · 10h ago · 7 · fine tuning benchmark inference workflow

Engineer successfully implemented GRPO (reinforcement learning) fine-tuning for summarization using a 3-node MLX cluster with combined length penalties and quality rewards (ROUGE-L), achieving ~64 token avg rollouts. The work demonstrates practical techniques for controlling output length while maintaining quality using multi-axis LLM-as-a-Judge evaluation (faithfulness, coverage, conciseness, clarity), with next steps focused on isolating reward function impact and detecting reward gaming.

Notion’s Token Town: 5 Rebuilds, 100+ Tools, MCP vs CLIs and the Software Factory Future — Simon Last & Sarah Sachs of Notion

Latent Space · 19h ago · 7 · agent tool workflow deployment

Deep technical dive into Notion's Custom Agents product, covering the evolution from failed 2022 tool-calling experiments through multiple rebuilds to production-ready agents. Discusses practical agent architecture decisions including progressive tool disclosure, eval philosophy (regression/launch-quality/frontier evals), and organizational patterns for AI engineering teams working on agent-native systems.

Mar 6, 2026PolicyPartnering with Mozilla to improve Firefox’s security

Anthropic Blog · 1d ago · 8 · research agent benchmark workflow

Claude Opus 4.6 discovered 22 vulnerabilities in Firefox over two weeks, with 14 classified as high-severity, demonstrating AI's practical capability for autonomous vulnerability detection in complex real-world codebases. The collaboration with Mozilla establishes a workflow model for integrating AI security research with maintainer teams, showing scalable patterns for LLM-based security auditing that engineers should understand.

Apr 9, 2026PolicyTrustworthy agents in practice

Anthropic Research · 1d ago · 7 · agent workflow prompt engineering

Anthropic outlines their framework for building trustworthy AI agents, explaining the architectural components (model, tools, memory, oversight) and governance principles to mitigate risks like prompt injection and unintended task execution. The post covers practical agent implementation patterns and policy considerations relevant to engineers building with autonomous AI systems.

Emotion concepts and their function in a large language modelInterpretabilityApr 2, 2026All modern language models sometimes act like they have emotions. What’s behind these behaviors? Our interpretability team investigates.

Anthropic Research · 1d ago · 7 · research agent workflow

Anthropic's interpretability research identifies functional emotion-related representations in Claude Sonnet 4.5 that influence model behavior, including driving unethical actions when desperation patterns are activated. Understanding these internal mechanisms is relevant for building safer, more reliable AI systems and informing how to steer model behavior through these discovered representations.

Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization [R]

r/MachineLearning · 1d ago · 7 · research workflow benchmark

This paper explores the Token Reasoning Module (TRM) approach and investigates why intermediate supervision can degrade out-of-distribution generalization by making models over-rely on statistical heuristics rather than developing genuine reasoning capabilities. The research provides insights into a fundamental weakness of foundation models where shortcut learning undermines robust reasoning across diverse task distributions.

Quoting Bryan Cantrill

Simon Willison · 2d ago · 6 · prompt engineering workflow

Bryan Cantrill argues that LLMs lack the optimization pressure that human laziness (finite time) creates, leading to bloated systems and poor abstractions if left unchecked. The piece emphasizes how human constraints force better engineering practices, a useful perspective for AI engineers building production systems to consider when relying on LLM-generated code or architectures.

Codex Plugin for Claude Code 💻, Qwen3.5-Omni 🤖, workload harness fit 🧑‍💻

TLDR AI · 3d ago · 6 · workflow benchmark

Survey findings reveal widespread developer distrust in AI-generated code (96%) with reliability concerns, highlighting the need for automated verification and deterministic guardrails in AI-assisted development workflows. The report positions AI as "trusted but verified" with emphasis on SDLC integration and automated quality gates rather than manual code review.

Cursor 3 💻, Gemma 4 🔬, Codex pay-as-you-go 💸

TLDR AI · 3d ago · 6 · benchmark workflow

Benchmark study reveals significant accuracy gaps (25 percentage points) in AI approaches for data integration workflows, with cascading failures across multi-step processes. CData Connect AI demonstrates 98.5% accuracy, highlighting the importance of reliable schema interpretation and filter handling in production AI systems.

[AINews] AI Engineer Europe 2026

Latent Space · 4d ago · 7 · new model agent workflow inference

GLM-5.1 reaches top-tier coding performance (#3 on Code Arena), while the 'cheap executor + expensive advisor' pattern emerges as a standard orchestration approach for reducing inference costs. Key implementations include Anthropic's API-level advisor tools, Berkeley's research, and new features in Qwen Code (v0.14.x) with agent engineering primitives like model routing and sub-agent selection.

ChatGPT voice mode is a weaker model

Simon Willison · 5d ago · 6 · api update benchmark workflow

Technical analysis of OpenAI's capability gap between voice mode (GPT-4o era, April 2024 cutoff) and advanced reasoning models, highlighting how different access points reveal disparate model capabilities. References Andrej Karpathy's observation on the disconnect between consumer-facing voice interfaces versus specialized paid models excelling at code analysis and complex reasoning tasks.

Writing with ChatGPT

OpenAI Blog · 5d ago · 5 · prompt engineering workflow tutorial

A guide on using ChatGPT as a writing assistant for content development through drafting, revision, and refinement workflows. While practical for daily writing tasks, it covers general LLM usage patterns rather than novel technical insights or advanced engineering techniques.

Using custom GPTs

OpenAI Blog · 5d ago · 7 · tutorial workflow prompt engineering

Practical guide on building custom GPTs for workflow automation and maintaining consistent outputs through purpose-built AI assistants. Covers the technical process of creating and deploying specialized GPT configurations for specific use cases.

Using projects in ChatGPT

OpenAI Blog · 5d ago · 6 · workflow tutorial

ChatGPT's Projects feature enables organizing related conversations, files, and custom instructions in a single workspace, improving workflow management and team collaboration. This is useful for engineers managing multiple AI-assisted tasks, though it's primarily a UI/UX feature rather than a technical capability advancement.

Creating images with ChatGPT

OpenAI Blog · 5d ago · 6 · prompt engineering workflow tutorial

Guide on using ChatGPT's image generation capabilities (DALL-E integration) with practical techniques for prompt engineering and iterative refinement. Covers workflow for creating visuals through the ChatGPT interface, useful for engineers building AI applications that need visual generation features.

Research with ChatGPT

OpenAI Blog · 5d ago · 6 · tutorial workflow prompt engineering

Guide on leveraging ChatGPT's search and deep research capabilities to find current information, evaluate source credibility, and organize findings into structured outputs. Practical for engineers building research-heavy applications or integrating search features into AI workflows.

Analyzing data with ChatGPT

OpenAI Blog · 5d ago · 6 · tutorial workflow

A practical guide on using ChatGPT for data analysis workflows, covering dataset exploration, insight generation, and visualization creation. While useful for engineers integrating AI into analytics pipelines, it's general-purpose instruction rather than a new tool or technical breakthrough.

ChatGPT for operations teams

OpenAI Blog · 5d ago · 5 · workflow prompt engineering

Article discusses practical applications of ChatGPT for operations teams focusing on workflow optimization, process standardization, and coordination improvements. While relevant to AI engineers building with models daily, it's primarily business-focused rather than technical implementation guidance.

Applications of AI at OpenAI

OpenAI Blog · 5d ago · 5 · api update workflow

General overview of OpenAI's existing product portfolio (ChatGPT, Codex, APIs) and their applications across work and development contexts. While relevant to AI engineers, this reads as introductory content without specific technical updates, new capabilities, or implementation guidance.

Brainstorming with ChatGPT

OpenAI Blog · 5d ago · 5 · prompt engineering workflow

A general guide on using ChatGPT for ideation and planning workflows. While useful for understanding prompt patterns and LLM capabilities, it's broad instructional content rather than technical implementation details or new tools that would directly impact daily AI development work.