BaseMind: MIT licensed full context layer (www.reddit.com via reddit)
Hi Peeps, I'm an open-source maintainer (Goldziher on Github) and the CTO of kreuzberg.dev. I published basemind — a pure-Rust MCP server and Claude / Codex / Gemini etc.
From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot You have a robot, a folder of demonstration data on the Hugging Face Hub, and a new task you want it to learn. Today that takes five separate tools: one to record…
There's no easy way to see what your coding agents have actually installed — skills, subagents, commands, plugins, MCP servers, hooks — or which sessions are still alive vs. safe to delete.
I think it's time Claude pays the piper and adds Reddit to Websearch / Webfetch (www.reddit.com via reddit)
For some searches reddit is a very useful resource. ChatGPT has it, Gemini has it, Grok even has it.
Large Language Models (LLMs) achieve strong performance on reasoning tasks, but whether this reflects faithful logical inference or heuristic approximation remains unclear. We study this question in legal entailment by comparing three para…
Hallo Community, weiß jemand, ob es eine Möglichkeit gibt, Claude pro zu testen? Auch wenn es nur 24 Std sind?
-
376 items
event
SecurityOpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.
- 1h Getting a Use caution before running this prompt warning on simple messages?
- 2h Are AI coding agents safe? Let's say Claude Code for that matter.
- 9h Claude Fable 5: The harness matters more than the model
- 11h They're demanding Fable to somehow be 100% jailbreak-proof. It's so fucking over.
- 16h Red-teaming agents with the GOAT attack strategy
117 itemsevent
MistralMistral, a French AI company, is set to release a medium-sized model with 128 billion parameters and is planning to launch Workflows in public preview. The company, founded by Arthur Mensch, continues to grow its AI empire despite not being based in the United States.
- 1h Run Agent Skills with mistral.rs v0.8.10: /v1/skills support and more
- 17h EU leaders to meet with top AI CEOs over access to advanced AI models today
- 19h Show HN: AptSelect – A local LLM client for parallel testing and evaluation
- 23h Which AI agent spent the money on your OpenAI/Anthropic bill
- 1d Mistral AI to produce a larger family of models
datasette-agent 0.3a0 (simonwillison.net)
15th June 2026 - New tool, execute_write_sql , which requests user approval and then writes to a database - taking user permissions into account. #27 I added a mechanism for asking user approval in datasette agent 0.2a0.
- datasette-agent 0.2a0 (simonwillison.net)
- datasette-agent 0.1a4 (simonwillison.net)
- Show HN: Datasette Agent (simonwillison.net via hn)
+3 more
- datasette-agent 0.1a3 (simonwillison.net)
- datasette-agent 0.1a2 (simonwillison.net)
- datasette-agent 0.1a1 (simonwillison.net)
Ask HN: How do you find out if the LLM API is giving degraded responses (news.ycombinator.com)
If you are building on top of multiple LLM APIs or even a single one amongst OpenAI, Claude, Gemini, etc. what do you do when the API starts degrading (slow TTFT, elevated error rates, timeouts).
Agentic Resource Discovery: Let agents search (huggingface.co)
Anthropic CEO Dario Amodei joins top AI CEOs meeting with world leaders at G7 summit (www.reddit.comhttps)
Anthropic CEO Dario Amodei and OpenAI CEO Sam Altman were among tech bosses at a G7 working lunch on AI, as the US decision to restrict access to Anthropic's most advanced models causes tension among allies. Fable soon guys?
Import AI 461: "Alignment is not on track"; FrontierCode; and synthetic research interns (importai.substack.com)
Import AI 461: "Alignment is not on track"; FrontierCode; and synthetic research interns Where are your agents right now? Welcome to Import AI, a newsletter about AI research.
Agent systems are advancing quickly across domains, but their evaluation remains fragmented. Most benchmarks rely on fixed, LLM-centric harnesses that require heavy integration, create test-production mismatch, and limit fair comparison ac…
I work at a startup that makes martial arts gym software (MAAT). We handle the memberships of students so gym owners don't have to, using a payment system and a database.
-
139 items
event
GlmRecent developments in the AI space highlight significant advancements from Chinese companies, particularly Zai's upgrade of GLM-5.1, which has shown substantial improvements. Meanwhile, there are concerns about a widespread intelligence drop across various models and discussions around the potential openness of leading AI projects like GLM 5.1.
415 itemsevent
Anthropic MythosAnthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.
SpaceX to acquire AI coding platform Cursor for $60 billion (arstechnica.com)
SpaceX will acquire AI coding tool Cursor for $60 billion in an all-stock transaction, the companies announced today. The deal is expected to close in the third quarter.
OpenAI WebRTC Audio Session, now with document context (simonwillison.net)
12th June 2026 - Link Blog OpenAI WebRTC Audio Session, now with document context. I built the first version of this tool in December 2024 to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.
What’s the one word that scares every vibe coder? (www.reddit.com via reddit)
For me it’s probably CORS. Claude Code can build half the app, refactor the backend, add auth, write tests, and then somehow I still end up staring at one browser error for 40 minutes.
When large language models (LLMs) fail to generalize or make haphazard errors in reasoning, it is often taken as evidence that LLMs are not truly reasoning, but rather performing a kind of pattern matching. The implication is that people's…
I am thinking of resubscribing to pro but is it usable ? (www.reddit.com via reddit)
I was an old subscriber, who decided to unsubscribe when Anthropic unilaterally cut limits during peak working hours. I am aware that the subscription is not usable for coding purposes, and that's ok.
[AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo a quiet day lets us reflect on a great essay Sarah Guo is a friend of the pod and Queen of AI, and after our Satya crossover pod (great recap here from Goku…
Used Claude design to create logo for my small business (www.reddit.com via reddit)
Disclaimer: I’m not an engineer I have just set up boutique financial advisory firm and want to get the basics (logo, wordmark etc in place quickly). I know exactly what I want as the logo.