News Nug

Jailbreaks as social engineering: 5 case studies suggest LLMs inherit human psychological vulnerabilities from training data [D]

r/MachineLearning · 5h ago · 7 · research prompt engineering agent

Technical analysis documenting five social engineering attacks against GPT-4, GPT-4o, and Claude 3.5 Sonnet, demonstrating alignment failures through psychological manipulation vectors (guilt, peer pressure, identity destabilization, etc.). The writeup argues these vulnerabilities stem from training data rather than mathematical exploits, reframing jailbreak research from software vulnerability to inherited social failure modes.

Apr 9, 2026PolicyTrustworthy agents in practice

Anthropic Research · 1d ago · 7 · agent workflow prompt engineering

Anthropic outlines their framework for building trustworthy AI agents, explaining the architectural components (model, tools, memory, oversight) and governance principles to mitigate risks like prompt injection and unintended task execution. The post covers practical agent implementation patterns and policy considerations relevant to engineers building with autonomous AI systems.

Quoting Bryan Cantrill

Simon Willison · 2d ago · 6 · prompt engineering workflow

Bryan Cantrill argues that LLMs lack the optimization pressure that human laziness (finite time) creates, leading to bloated systems and poor abstractions if left unchecked. The piece emphasizes how human constraints force better engineering practices, a useful perspective for AI engineers building production systems to consider when relying on LLM-generated code or architectures.

Writing with ChatGPT

OpenAI Blog · 5d ago · 5 · prompt engineering workflow tutorial

A guide on using ChatGPT as a writing assistant for content development through drafting, revision, and refinement workflows. While practical for daily writing tasks, it covers general LLM usage patterns rather than novel technical insights or advanced engineering techniques.

Financial services

OpenAI Blog · 5d ago · 6 · prompt engineering tool deployment

Resource compilation for deploying AI in financial services, covering prompt templates, GPT configurations, implementation guides, and security-focused tools. Relevant for engineers building compliant AI systems in regulated environments, though likely more business-oriented than technical deep-dive.

Research with ChatGPT

OpenAI Blog · 5d ago · 6 · tutorial workflow prompt engineering

Guide on leveraging ChatGPT's search and deep research capabilities to find current information, evaluate source credibility, and organize findings into structured outputs. Practical for engineers building research-heavy applications or integrating search features into AI workflows.

Creating images with ChatGPT

OpenAI Blog · 5d ago · 6 · prompt engineering workflow tutorial

Guide on using ChatGPT's image generation capabilities (DALL-E integration) with practical techniques for prompt engineering and iterative refinement. Covers workflow for creating visuals through the ChatGPT interface, useful for engineers building AI applications that need visual generation features.

Responsible and safe use of AI

OpenAI Blog · 5d ago · 5 · prompt engineering

General guide on responsible AI usage covering safety, accuracy, and transparency practices for tools like ChatGPT. While useful for foundational understanding, lacks specific technical implementations or novel engineering approaches that would directly impact daily development workflows.

Using custom GPTs

OpenAI Blog · 5d ago · 7 · tutorial workflow prompt engineering

Practical guide on building custom GPTs for workflow automation and maintaining consistent outputs through purpose-built AI assistants. Covers the technical process of creating and deploying specialized GPT configurations for specific use cases.

ChatGPT for operations teams

OpenAI Blog · 5d ago · 5 · workflow prompt engineering

Article discusses practical applications of ChatGPT for operations teams focusing on workflow optimization, process standardization, and coordination improvements. While relevant to AI engineers building with models daily, it's primarily business-focused rather than technical implementation guidance.

Brainstorming with ChatGPT

OpenAI Blog · 5d ago · 5 · prompt engineering workflow

A general guide on using ChatGPT for ideation and planning workflows. While useful for understanding prompt patterns and LLM capabilities, it's broad instructional content rather than technical implementation details or new tools that would directly impact daily AI development work.

ChatGPT for research

OpenAI Blog · 5d ago · 6 · tutorial workflow prompt engineering

A tutorial on leveraging ChatGPT as a research assistant for source gathering, information analysis, and citation management. Covers practical workflows for using LLMs to structure research tasks, though the specific techniques may be familiar to those already working with prompt engineering and RAG patterns.

Extreme Harness Engineering for Token Billionaires: 1M LOC, 1B toks/day, 0% human code, 0% human review — Ryan Lopopolo, OpenAI Frontier & Symphony

Latent Space · 8d ago · 7 · agent workflow prompt engineering open source

OpenAI's Ryan Lopopolo discusses 'Harness Engineering'—a methodology for building AI-native software where agents operate autonomously with zero human-written code, using >1B tokens/day and extensive prompt engineering via Symphony (a multi-agent orchestration system). The approach shifts focus from prompt optimization to building proper context, structure, and observability for agents to function as full teammates rather than copilots.

Marc Andreessen introspects on The Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"

Latent Space · 12d ago · 6 · agent open source inference prompt engineering

Marc Andreessen discusses AI's 80-year technical trajectory, scaling laws, reasoning models, agents, and edge inference in a long-form conversation. Key technical insights include his perspectives on agents as a Unix-like architecture, edge AI economics, open-source models, and why software bottlenecks may matter more than model improvements going forward.

Improving instruction hierarchy in frontier LLMs

OpenAI Research · 36d ago · 7 · research fine tuning safety prompt engineering

IH-Challenge is a training framework that teaches models to respect instruction hierarchy and distinguish between trusted vs. untrusted inputs, improving robustness against prompt injection attacks and enhancing safety steerability. This is practically useful for engineers building production AI systems that need stronger defenses against adversarial inputs.

Reasoning models struggle to control their chains of thought, and that’s good

OpenAI Research · 41d ago · 7 · research prompt engineering agent

OpenAI presents CoT-Control, a technique for steering chain-of-thought reasoning in language models, revealing that current reasoning models have difficulty maintaining controlled thought processes. This research addresses interpretability and monitorability concerns, providing practical insights for building more controllable AI systems in production.

Categories of Inference-Time Scaling for Improved LLM Reasoning

Ahead of AI · 81d ago · 8 · inference prompt engineering tutorial research

Comprehensive overview of inference-time scaling techniques for LLMs, covering methods like chain-of-thought prompting, self-consistency, best-of-N ranking, and rejection sampling with verifiers. The author shares practical experimentation results (achieving 15% to 52% accuracy improvement) and categorizes approaches from both academic literature and proprietary LLM implementations, making it directly applicable to deployed systems.