1. Back in Web 2.0 times openness was the default for Web-platforms, will AI agents lead us back there? A long time ago, we had Web 2.0!

  2. From the Hugging Face Hub to robot hardware with Strands Agents and LeRobot You have a robot, a folder of demonstration data on the Hugging Face Hub, and a new task you want it to learn. Today that takes five separate tools: one to record…

  3. This happened to me a week a go when I asked Claude to give me the Grand Plan to execute my future plans we are discussing about. It outright said "No, I won't give you!" and The reason is that I have so many chats going on and haven't don…

  4. 15th June 2026 - New tool, execute_write_sql , which requests user approval and then writes to a database - taking user permissions into account. #27 I added a mechanism for asking user approval in datasette agent 0.2a0.

  5. Large Language Models (LLMs) achieve strong performance on reasoning tasks, but whether this reflects faithful logical inference or heuristic approximation remains unclear. We study this question in legal entailment by comparing three para…

  6. With the advent of agents, automatic programming has become something really serious. You have now an always-on buddy ready to help, implement, and validate your implementation plans and code changes.

  7. event

    Security
    371 items

    OpenAI has released GPT-5.4-Cyber for testing as part of its Trusted Access for Cyber Defense program, aiming to compete with Anthropic's Claude Mythos in the cybersecurity domain. Meanwhile, concerns are rising over the potential risks associated with advanced AI models like Mythos, prompting calls for improved defenses before wider releases.

    model roundup

    Opus 4.7
    7 items

    Opus 4.7 has gained more frequent use among certain users, such as creative professionals and therapists, who prefer it over newer versions like 4.8 or older ones like 4.6. Some users report that Opus 4.7 handles context documents better than later versions.

  8. SpaceX will acquire AI coding tool Cursor for $60 billion in an all-stock transaction, the companies announced today. The deal is expected to close in the third quarter.

  9. Import AI 461: "Alignment is not on track"; FrontierCode; and synthetic research interns Where are your agents right now? Welcome to Import AI, a newsletter about AI research.

  10. Noema64 Noema64 is an open-source explainable chess engine that uses a language model as a persistent strategic planner while deterministic Go code owns legal move validation, game state, fallback, UCI protocol behavior, traces, and local…

  11. When large language models (LLMs) fail to generalize or make haphazard errors in reasoning, it is often taken as evidence that LLMs are not truly reasoning, but rather performing a kind of pattern matching. The implication is that people's…

  12. 12th June 2026 - Link Blog OpenAI WebRTC Audio Session, now with document context. I built the first version of this tool in December 2024 to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.

  13. If you've spent any serious time with Claude Code, you've likely already been frustrated by the AI forgetting everything. You spend the first twenty minutes of the session describing project structure, coding conventions, and why you're us…

  14. Agent systems are advancing quickly across domains, but their evaluation remains fragmented. Most benchmarks rely on fixed, LLM-centric harnesses that require heavy integration, create test-production mismatch, and limit fair comparison ac…

  15. event

    Glm
    132 items

    Recent developments in the AI space highlight significant advancements from Chinese companies, particularly Zai's upgrade of GLM-5.1, which has shown substantial improvements. Meanwhile, there are concerns about a widespread intelligence drop across various models and discussions around the potential openness of leading AI projects like GLM 5.1.

    412 items

    Anthropic's new update, Claude Mythos, has garnered attention from top AI security researchers like Carlini, who found numerous bugs. The update is noted for its speed and effectiveness, with Anthropic identifying a significant security flaw in FFmpeg and quickly submitting patches.

  16. Every major LLM provider had at least one significant outage in 2025. Anthropic, OpenAI, Gemini — all of them, at some point, just stopped responding mid-request.

  17. 11th June 2026 - Link Blog Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude (via) Big scoop for Maxwell Zeff at Wired: “We’re changing Fable 5’s safeguards for frontier LLM development to make them visibl…

  18. Hey HN, Following the OpenClaw craze, I saw a huge need for hosting personal Claws/Hermes agents on macOS [0]. So I built an agent to scrape eBay for below-market M-series Macs and built a Mac mini datacenter [1].

  19. [AINews] Open Models, Model Labs vs Agent Labs, and What's Untrainable — Sarah Guo a quiet day lets us reflect on a great essay Sarah Guo is a friend of the pod and Queen of AI, and after our Satya crossover pod (great recap here from Goku…

  20. About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features NFL Sunday Ticket © 2026 Google LLC

  21. The gravity around a black hole is so extreme that nothing, not even light, can escape once it gets close enough. Astrophysicists like Chi-kwan Chan study black holes with computer simulations and observations.