MosaicLeaks: Can your research agent keep a secret? (huggingface.co)
MosaicLeaks: Can your research agent keep a secret? TL;DR Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive inf…
Disclosed upfront: I run [Tickerr dot ai], an independent external monitor for AI APIs. Today it tracks latency, TTFT, uptime, and error rates across major models.
A longstanding goal of research on interpretable deep learning is to replace opaque neural computations with human-meaningful symbolic descriptions. In this paper, we propose an approach for approximating the behavior of components of deep…
built a factchecker that catches politicians lying in real time (www.reddit.comhttps)
hi everyone ! built this as part of a larger NLP / deception research project at my university, wanted to share in case anyone finds it useful!
Improving health intelligence in ChatGPT (openai.com)
-
140 items
event
GlmRecent developments in the AI space highlight significant advancements from Chinese companies, particularly Zai's upgrade of GLM-5.1, which has shown substantial improvements. Meanwhile, there are concerns about a widespread intelligence drop across various models and discussions around the potential openness of leading AI projects like GLM 5.1.
- 12h GLM 5.2 is now available via a unified Model API
- 12h An open-source AI just beat OpenAI's GPT-5.5 at coding (1/6th the price)
- 12h GLM 5.2 playing text adventures
- 19h GLM-5.2 is probably the most powerful text-only open weights LLM
- 19h GLM 5.2 via Claude Code is the first non-Claude model that feels close to Opus
421 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
- 52m Anthropic confident of re-enabling Mythos, Fable 5 access 'in coming days'
- 5h We Got Anthropic's Glasswing at Home (Who Needs Mythos 5 or Fable 5?)
- 5h Ask HN: Could Fable/Mythos be used to build Python's JIT?
- 6h Before Mythos, Satan: A 1990s Software Satanic Panic
- 6h The Korean Telecom Giant at the Center of Anthropic's Mythos Controversy
How do you find an MCP? (www.reddit.com via reddit)
Where do you currently find MCP servers? How do you discover new ones?
Choji: Agents to take a customer ask to pull request autonomously (twitter.com via hn)
Excited to open up the beta for Choji (https://t.co/7b0K7jM5LI) Choji is a platform of agents that takes an ask and turn it into a ready-to-merge PR fully autonomously so you can focus on bigger stuff. Watch us use Choji to ship new site…
Securing the future of AI agents (deepmind.google)
- Future of Work with AI Agents (futureofwork.saltlab.stanford.edu via hn)
last week i was "working" on 5 things and i was completely overwhelmed to the point that i literally didn't know what to do. i did use ai chatting apps (ChatGPT, Claude, Gemini, etc..) and explained to them what i have in my mind but they…
Large Language Models (LLMs) achieve strong performance on reasoning tasks, but whether this reflects faithful logical inference or heuristic approximation remains unclear. We study this question in legal entailment by comparing three para…
-
378 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
173 itemsmodel roundup
Opus 4.8Claude AI has released Opus 4.8, an upgrade to their Opus class of models available in version 2.1.154 of their software on March 16, 2023, which includes enhanced coding and professional task capabilities along with improved judgment and honesty. Users are reporting usage resets following the update.
New in Claude Code: Artifacts (www.reddit.comhttps)
Interactive pages built from your session, like a PR walkthrough or a living project dashboard, shared with your team at a private link. Available in beta on Team and Enterprise plans.
- Claude Code now supports artifacts (claude.com via hn)
- Scrolling up in Claude Code? (www.reddit.com via reddit)
- No more 1M context in Claude Code? (www.reddit.com via reddit)
+51 more
- Claude Code Is a Chainsaw (mechanicalsurvival.com via hn)
- Claude Code Is Dead (claude-code-is-dead.vercel.app via hn)
- Claude Code Reset! (www.reddit.comhttps)
- My Claude Code Setup (illuminatedcomputing.com via hn)
- Learn Claude Code (www.reddit.comhttps)
- Fable 5 Claude code (www.reddit.com via reddit)
- Claude Code Endgame (www.reddit.comhttps)
- Cursor vs Claude Code (www.reddit.com via reddit)
- Claude Code vs. Codex (news.ycombinator.com)
- Claude Code is not about code anymore (blog.vtemian.com via hn)
- Claude Code from the Beach (rogs.me via hn)
- Strava for Claude Code (straude.com via hn)
- Claude Code Ultracode (note.com via hn)
- Some obsidian + Claude code (www.reddit.com)
- Claude Design and Code (www.reddit.com)
- Trading with claude code (www.reddit.com)
- I Tried Claude Code (zhenyi.gibber.blog via hn)
- Claude code and Cowork artifacts on iPhone (www.reddit.com)
- Claude Code in a Loop (github.com via hn)
- Claude design to code (www.reddit.com)
- Claude Code Update (www.reddit.com)
- Claude Code Rules (www.reddit.com)
- Claude Code in the Browser (chromewebstore.google.com via hn)
- Claude Code + Notion AI (www.reddit.com)
- /goal for Claude Code (www.reddit.com)
- /goal in claude code (www.reddit.com)
- /goal in claude code (www.reddit.com)
- Claude code vs Codex (www.reddit.com)
- Claude Code Sandboxing (code.claude.com via hn)
- Setting up Claude code (www.reddit.com)
- Telemetry for Claude Code (latitude.so via hn)
- Claude code (www.reddit.com)
- Where should I start with Claude Code? (www.reddit.com)
- Errors in Claude Code (www.reddit.com)
- Claude Code App? (www.reddit.com)
- What Happened to Claude Code (man-labs.com via hn)
- Multiplayer Claude Code (www.reddit.com)
- How to start with Claude Code (www.reddit.com)
- Claude Code Manager (www.reddit.com)
- Does Claude Code Hate UI's? (www.reddit.com)
- A Harness for Claude Code (euleptos.com via hn)
- Claude Code Hackathon! (www.reddit.com)
- Aider and Claude Code (www.reddit.com)
- Qwen 3.6 for Claude Code in 1L (www.reddit.com)
- Claude Code Manager (www.reddit.com)
- How I feel when I Claude Code.. (www.reddit.com)
- Claude Code Routines (code.claude.com via hn)
- Routines in Claude Code (claude.com via hn)
- HOW TO USE CLAUDE CODE (www.reddit.com)
- Claude Code + Obsidian? (www.reddit.com)
- Claude Changes My Code (alexcbecker.net via hn)
Import AI 461: "Alignment is not on track"; FrontierCode; and synthetic research interns (importai.substack.com)
Import AI 461: "Alignment is not on track"; FrontierCode; and synthetic research interns Where are your agents right now? Welcome to Import AI, a newsletter about AI research.
Is it agentic enough? Benchmarking open models on your own tooling (huggingface.co)
When large language models (LLMs) fail to generalize or make haphazard errors in reasoning, it is often taken as evidence that LLMs are not truly reasoning, but rather performing a kind of pattern matching. The implication is that people's…
From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot You have a robot, a folder of demonstration data on the Hugging Face Hub, and a new task you want it to learn. Today that takes five separate tools: one to record…
-
421 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
105 itemsevent
Fine TuningFine-tuning is a hot topic in the AI community, with various projects and releases focusing on it. Notable examples include OpenAI's decision to wind down its fine-tuning API, Anthropic co-founder Jack Clark's prediction that AI research could become automated by 2028, and several new datasets and models released for fine-tuning purposes.
- 19h Beyond LoRA: Can you beat the most popular fine-tuning technique?
- 1d The Guide to Fine-Tuning LLMs
- 1d Could we use latent representations as internal safety checks during generation?
- 2d Show HN: Does a vibe leak? Fine-tuning an LLM on an attitude it never states
- 7d Parallelogram – catch fine-tuning dataset bugs before training
The internet is on life support (news.ycombinator.com)
I am wondering what is going to happen to the internet when you can no longer navigate it with a Google Search? Its just astounding what OpenAI and Anthropic have accomplished, and I reminisce.
datasette-agent 0.3a0 (simonwillison.net)
15th June 2026 - New tool, execute_write_sql , which requests user approval and then writes to a database - taking user permissions into account. #27 I added a mechanism for asking user approval in datasette agent 0.2a0.
- datasette-agent 0.2a0 (simonwillison.net)
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
+3 more
- datasette-agent 0.1a3 (simonwillison.net)
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
A PostgreSQL Database for Every Agent: In-Database RAG, Graph, and Multitenancy (www.yugabyte.com via hn)
- Why our AI agent needed a causal graph, not just a RAG database (openyf.dev via hn)
Agent systems are advancing quickly across domains, but their evaluation remains fragmented. Most benchmarks rely on fixed, LLM-centric harnesses that require heavy integration, create test-production mismatch, and limit fair comparison ac…
Agentic Resource Discovery: Let agents search (huggingface.co)