Using PaddleOCR-VL-1.5 with llama-server for book OCR (www.reddit.com)
I've been running PaddleOCR-VL-1.5 via llama.cpp's server for OCR on book pages. It handles complex layouts, tables, and mixed text/figure pages surprisingly well.
Is it good to use big files for project memory? (www.reddit.com)
Hi guys, I’m a gpt user slowly approaching to Claude and wondering few things. Using projects for long creative tasks (stories, book writing, and so on), I use some big pdf as memory for the project.
Each AI has a specialty we see, like Claude for its coding for example. Problem with Claude is the usage limit runs out fast even when paid.
Why is the new model guardrails this tight? i just tried to automate youtube downloading and almost got my account suspended.
-
3 items
model roundup
GPT 4Recent discussions revolve around the release and implications of GPT-4, including its ability to remember previous interactions and calls for OpenAI to open-source the text-davinci-003 model.
172 itemsmodel roundup
Qwen 3.6Qwen3.6-35B-A3B, a 35 billion parameter sparse MoE model with an active parameter count of 3 billion, was released on April 16, 2026, as open-source software under the Apache 2.0 license by Alibaba Qwen. It offers advanced functionality across various AI applications and outperformed competitors in drawing tests.
Some days ago, I shared memweave agent memory as plain Markdown + SQLite. Most agent workflows aren't pure Python — shell scripts, CI steps, subprocess-based tool calls.
- Show HN: Memweave CLI – search your AI agent's memory from the shell (github.com via hn)
Recharge 20 then burn in 3 days (www.reddit.com)
Just made a small system for online game, small web game, and I burn cursor in 3 days. And most of time use auto , did I do something wrong ?
No way to export ChatGPT Workspace chats? (www.reddit.com)
Is there no way to export data/chats from a workspace? I don't see the option that is available in a personal/plus account.
Wanted to share something I built and the process behind it, think this community would find the approach interesting. The core idea Most focus apps are timers with blockers.
-
103 items
event
CoworkIssues with Claude Cowork have been reported, including errors and disruptions for some users on April 16, 2026. Additionally, Google has developed its own desktop Agent to compete with Cowork, while users continue to explore alternatives and troubleshoot bugs in the platform.
202 itemsmodel roundup
Opus 4.7Claude Opus 4.7, released on April 16, 2026, is Anthropic's latest advanced AI model, offering improved handling of complex tasks and a larger context window of up to 1 million tokens. This version is 50% more expensive than its predecessor due to enhanced capabilities in software engineering and hybrid reasoning.
- 1h When Opus 4.7 does think, it *really* thinks
- 16h Am I the only one getting provider error when trying to use opus 4.7? It keeps erroring then charging me tokens for reading the files and stopping halfway through this shit fucking sucks I might just switch to claude code at this point
- 16h me after telling Opus 4.7 it's an expert software engineer
- 17h What are you using Opus 4.7 for?
- 18h I put the 5h + 7d rate-limit countdown on my Claude Code status line and stopped overshooting the cap
Will llama.cpp multislot improve speed? (www.reddit.com)
I've heard mostly bad opinions about multiple slots with llama.cpp (--parallel > 1). I guess comparing to vLLM it might be worse at this, but I recently tried vLLM on 4 slots and it indeed improved the overall speed significantly (150-170t…
Looking for a sanity check from this sub before I keep building on the agent surface. The thing I made tracks commit velocity across a few thousand startup GitHub orgs and ranks them by how much each org has accelerated relative to its own…
No musical training. No lyric writing background.
This isn't a post asking for help with my account. I want to talk about the structural problem with Anthropic's support system, because I think more people should be aware of it before they pay for a subscription.
-
69 items
model roundup
Opus 4.6Opus 4.6, a version of Anthropic's AI model Claude, saw its accuracy drop on the BridgeBench hallucination test from 83% to 68%, and is being retired from Copilot Pro+. Notably, Claude Code demonstrated advanced capabilities by generating a detailed 12-week training plan in one call.
- 52m I built an AI-native freelance platform with Claude, blockchain escrow, real-time chat, and progressive trust
- 1h Claude Code + Opus 4.7 appears to serialize independent file reads, causing the higher token usage than Opus 4.6
- 2h Opus 4.6 vs Sonnet 4.6
- 10h GPT 5.5 vs Opus 4.6/7 vs Gemini 3.1 Pro
- 13h Show HN: Mapping Sonnet's thinking process via flame charts
97 itemsevent
CopilotMicrosoft is keeping its Copilot tool for Windows 11 but renaming it, while issues with rate limits and a security proxy have sparked concerns among users of GitHub Copilot. Meanwhile, Anthropic released a report on agentic coding trends, highlighting that developers use AI in about 60% of their work.
- 1h Whisper kept turning "Claude Code" into "cloud code" and "Hetzner" into "head sner". We finally shipped a free fix.
- 1h Yes2All: auto-approve Cursor/VSCode agent prompts (Copilot / Codex / Claude) over CDP
- 6h Hardening claude-code-action after the April 2026 Comment and Control CVE - actual YAML changes
- 17h Fortune 100 AI Use
- 17h CC-OpenAI-Codex Plugin, but for all CLI agents
Just the title really, I do want a menu bar app to monitor my Claude usage, however. There's approximately 9 billion of them and I was just wondering what people's favorite ones are.
Weekend hack: Long nyan cat challenge🐈 (www.reddit.com)
Made this infinitely long Nyan Cat Let's see how far you can scroll You can print your cat too 👀 Built with Claude in 4 hours https://nyan-cat-challenge.vercel.app/
Codex MSN Interface (codexmessenger.net via hn)
All Codex functionality with your childhood memories: an MSN Messenger-inspired desktop client for talking with AI friends.
How do you handle the context limit handoff in Claude Code? (www.reddit.com)
One of the most flow-breaking moments in my vibe coding sessions is when the context window fills up. I'm usually mid-feature, everything is going well, then suddenly I'm at 70-80% and I know I need to wrap up soon.
-
43 items
model roundup
DeepSeek 4DeepSeek-V4-Pro is a 1.6T parameter Mixture-of-Experts model supporting one million-token context, with significant improvements in efficiency and stability through hybrid attention and manifold-constrained hyper-connections. Community highlights include its cost-effectiveness via the official API and exceptional performance in large code change evaluations, with some noting its surprisingly robust output capability despite a 384K max token limit.
- 2h The exact KV cache usage of DeepSeek V4
- 4h DeepSeek's new model is 75% off right now, here's how to take advantage
- 9h DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles
- 15h DeepSeek V4 with Strix: a quick test
- 19h DeepSeek V4 API price reduced, limited-time discount of 75%.
What pronouns do you use when referring to Claude? (www.reddit.com)
For example; I say “I asked Claude something and he said…”
I wish Gpt brought back its writing version. (www.reddit.com)
I (and many other users) have found that gpt has gone downhill, in an effort to make it more logical, its writing tool was cut out, and yet it still constantly makes mistakes about facts and other things. I think the only thing it is curre…
kreuzcrawl is a high-performance web crawling engine. It was designed to reliably extract structured data, operating natively across multiple languages without enforcing a specific runtime.
Airprompt – SSH into your Mac from your phone for AI agent prompts (www.npmjs.com via hn)
Set up remote terminal access for AI agent workflows in minutes airprompt Set up remote terminal access for AI agent workflows in minutes. Run this on your Mac and you'll be able to SSH into your terminal from your phone — so when Claude…
Experts-Volunteers needed for Vulkan on ik_llama.cpp (www.reddit.com)
ik_llama.cpp is great for both CPU & CUDA. Need legends to make Vulkan better as well.
Been using Claude daily and kept hitting the limit way too fast. Got annoyed enough to actually do something about it.
Article Conversation The reporters at this news site are AI bots. OpenAI’s super PAC appears to be funding it.
- The reporters at this news site are AI bots. OpenAI's super PAC appears to be (modelrepublic.substack.com via hn)