W2A – Open Protocol for Agent Perception (github.com via hn)
Agents can't act on what they can't perceive. Website · Quick Start · Sensors · SensorHub · Docs · Community Watch the W2A Concept Video What is World2Agent?
Show HN: Django-Modern-Rest (github.com via hn)
Hi, my name is Nikita Sobolev, I am a CPython core dev, Django Software Foundation member, and maintainer of countless Python / Django opensource tools. Now I am happy to present to you my new project.
Hi all- it feels like more and more both OpenAI and Anthropic is hyper focussed on coding and AI agents for coding. If you look at 5.5 model changes, they are mostly just talking about writing code and what not.
Try to go with searXNG as you search results by multiple engines + its open-sourced. Use firecrawl / jina / fetch for reading the source.
Has Anyone vibe coding an AI Agent or Agentic AI system?! (www.reddit.com)
Hey everyone! Looking for some guidance and suggestions, as to whether anyone has worked or is working on building AI Agents or Agentic AI systems completely through vibecoding, especially by LangChain+LangGraph.
We’ve been testing a small phone-automation prototype. What keeps coming up isn’t whether it can click through screens .
-
220 items
model roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
- 4m How do you objectively tell if your custom agent tools are actually better?
- 2h What it feels like to have to have Qwen 3.6 or Gemma 4 running locally
- 2h Qwen3.6 27B on dual RTX 5060 Ti 16GB with vLLM: ~60 tok/s, 204k context working
- 4h Ran my own benchmark Qwen 3.6 35B vs Gemma 4 26B.... theres a clear winner here
- 12h 3.6 27B Tool Calling Issues (vLLM)
225 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 8m The Beautiful Lie - Teaser
- 6h Deepseek v4 pricing is genuinely silly, did the math and now i am questioning my entire stack
- 7h Crystal Sapphire Pokemon: Claude Code (Opus 4-7) vs. Codex (GPT 5.5)
- 10h Opus 4.7 is just 4.6 with a stick up its butt. Give me my tokens back!
- 10h Claude Status Update : Elevated errors on Claude Opus 4.7 on 2026-04-29T00:00:29.000Z
Curious what stacks people are actually using right now, and where you're hitting walls. Some things I've been observing while testing combos: - Deepgram Nova-3 still the best STT for English, Cartesia is closing the gap on streaming - Ele…
LMAO CLAUDE IS SO FUNNY (www.reddit.com)
could not extract summary
How do AI agents improve operational efficiency in businesses? (www.reddit.com)
Curious how AI agents are actually improving day-to-day operations in businesses. Are they meaningfully reducing workload and costs, or just shifting effort into oversight and corrections?
It's been like this forever. why cursor explain!!
I decided to let two AI agents run my life. Big mistake.
Stop Winning Arguments. Start Using "Claude Mode" Instead (basila.medium.com via hn)
Avoid stressing yourself, save your relationships, avoid obsessing about irrelevant arguments by trying out Claude’s approach to discussions and disagreements 3 min read 23 hours ago Press enter or click to view image in full size Photo by…
-
62 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 13m Best Practices to Start with Vibe Coding? Best Local Apps for Agentic Vibe Coding?
- 7h A 3D Flappy Bird side-scroller game built with DeepSeek V4 Pro
- 14h I ran DeepSeek V4-Flash internals on 8x H100s — here’s what mHC actually does
- 17h Is paying for deepseek v4 pro worth it or are there better alternatives
- 21h DeepSeek V4 Pro: Validating Frontier Models for Production
81 itemsmodel roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 33m So I gave claude Leetcode problem 3245.
- 3h Why Codex works better than Claude Code for my production monolith
- 3h Who's on call? How Opus 4.6 helped us calculate this 2,500x faster
- 5h Anyone else seeing Opus 4.6 (legacy) back in the Claude Desktop Code tab model picker?
- 10h Suggestions For Making Claude Less Lazy?
How did AI impact your workplace? (www.reddit.com)
Let's take a software dev team from a few years ago, to now. The devs and testers are each now given Cursor or equivalent.
Every "AI website builder" I tried produced the same template fill: "We are passionate about innovation" "Cutting-edge solutions for the modern era" Three feature cards titled Innovative, Reliable, Dedicated with one sentence of fluff each…
I benchmarked caveman against the prompt "be brief" (www.reddit.com)
I’m in this weird in-between moment with AI research workflows. There’s tools that can search/summarise/generate/cite sources, but the workflow still feels fragmented at best.
grokfeed Terminal feed reader for Hacker News, Reddit, and lobste.rs. Features Unified scrollable feed from HN, Reddit subreddits, and lobste.rs Color-coded by source: HN orange, lobste.rs red, subreddits in a cycling palette Read text pos…
Agentic Engineering Management (peterszasz.com via hn)
To what extent AI is OK to use in software development might be debated, but in general, the idea is not a controversial one anymore. The debate rather moved on from code completion and simple PR summarizations to Agentic Engineering, wher…
-
137 items
event
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
92 itemsmodel roundup
GPT 5.5On [Date], a significant leak of the OpenAI Codex model, referred to as GPT-5.5, was captured on video before it was patched. The incident involved models named Arcanine and Glacier-alpha.
- 1h GPT 5.5 passes the cup test
- 5h I stumbled on a Gemma 4 chat template bug for tools and fixed it
- 12h Quoting OpenAI Codex base_instructions
- 16h why does GPT 5.5 have a restraining order against "Raccoons," "Goblins," and "Pigeons"?
- 20h GPT-5.5 prompt for Codex tries to make it not talk about goblins
AI based Research suggestion (www.reddit.com)
Hey guys, any suggestions on what tools or methods which works best in the current market for research on any topics in general. I mostly do research on AI tools, agentic frameworks, what is new, what problems exist etc.
SenseNova U1 is a new series of native multimodal models that unifies multimodal understanding, reasoning, and generation within a monolithic architecture. It marks a fundamental paradigm shift in multimodal AI: from modality integration t…
Claude Code replaced my entire workflow (www.reddit.com)
I haven't opened VS Code in three weeks. Started using Claude Code for one quick Python script in December.
demcstify Research project: LLM-assisted reconstruction of partially decompiled Minecraft 26.1.2 sources into fully buildable, runnable, bytecode-equivalent local client and server artifacts. demcstify is not a Minecraft source distributio…
Llama.cpp MIPS R8000 Kernel Running on an SGI Power Challenge from 1995 (twitter.com via hn)
could not extract summary
Saw a case recently where an AI coding agent ended up wiping a database in seconds. Curious how people here are handling this in real setups.