1. I've been running PaddleOCR-VL-1.5 via llama.cpp's server for OCR on book pages. It handles complex layouts, tables, and mixed text/figure pages surprisingly well.

  2. Hi guys, I’m a gpt user slowly approaching to Claude and wondering few things. Using projects for long creative tasks (stories, book writing, and so on), I use some big pdf as memory for the project.

  3. Each AI has a specialty we see, like Claude for its coding for example. Problem with Claude is the usage limit runs out fast even when paid.

  4. Why is the new model guardrails this tight? i just tried to automate youtube downloading and almost got my account suspended.

  5. model roundup

    Qwen 3.6
    172 items

    Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.

  6. Some days ago, I shared memweave agent memory as plain Markdown + SQLite. Most agent workflows aren't pure Python — shell scripts, CI steps, subprocess-based tool calls.

  7. Just made a small system for online game, small web game, and I burn cursor in 3 days. And most of time use auto , did I do something wrong ?

  8. Is there no way to export data/chats from a workspace? I don't see the option that is available in a personal/plus account.

  9. Wanted to share something I built and the process behind it, think this community would find the approach interesting. The core idea Most focus apps are timers with blockers.

  10. event

    Cowork
    103 items

    Issues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.

    model roundup

    Opus 4.7
    202 items

    Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.

  11. I've heard mostly bad opinions about multiple slots with llama.cpp (--parallel > 1). I guess comparing to vLLM it might be worse at this, but I recently tried vLLM on 4 slots and it indeed improved the overall speed significantly (150-170t…

  12. Looking for a sanity check from this sub before I keep building on the agent surface. The thing I made tracks commit velocity across a few thousand startup GitHub orgs and ranks them by how much each org has accelerated relative to its own…

  13. No musical training. No lyric writing background.

  14. This isn't a post asking for help with my account. I want to talk about the structural problem with Anthropic's support system, because I think more people should be aware of it before they pay for a subscription.

  15. model roundup

    Opus 4.6
    69 items

    Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.

    event

    Copilot
    97 items

    Microsoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.

  16. Just the title really, I do want a menu bar app to monitor my Claude usage, however. There's approximately 9 billion of them and I was just wondering what people's favorite ones are.

  17. Made this infinitely long Nyan Cat Let's see how far you can scroll You can print your cat too 👀 Built with Claude in 4 hours https://nyan-cat-challenge.vercel.app/

  18. All Codex functionality with your childhood memories: an MSN Messenger-inspired desktop client for talking with AI friends.

  19. One of the most flow-breaking moments in my vibe coding sessions is when the context window fills up. I'm usually mid-feature, everything is going well, then suddenly I'm at 70-80% and I know I need to wrap up soon.

  20. model roundup

    DeepSeek 4
    43 items

    DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.

  21. For example; I say “I asked Claude something and he said…”

  22. I (and many other users) have found that gpt has gone downhill, in an effort to make it more logical, its writing tool was cut out, and yet it still constantly makes mistakes about facts and other things. I think the only thing it is curre…

  23. kreuzcrawl is a high-performance web crawling engine. It was designed to reliably extract structured data, operating natively across multiple languages without enforcing a specific runtime.

  24. Set up remote terminal access for AI agent workflows in minutes airprompt Set up remote terminal access for AI agent workflows in minutes. Run this on your Mac and you'll be able to SSH into your terminal from your phone — so when Claude…

  25. ik_llama.cpp is great for both CPU & CUDA. Need legends to make Vulkan better as well.

  26. Been using Claude daily and kept hitting the limit way too fast. Got annoyed enough to actually do something about it.

  27. Article Conversation The reporters at this news site are AI bots. OpenAI’s super PAC appears to be funding it.