event

Function Calling

12 items · started 2023-06-13 · ongoing (last activity 2026-05-03)

Local LLM Benchmark about Backend Generation by Function Calling (GLM vs Qwen vs DeepSeek) (www.reddit.com)

+1 17h function-calling glm deepseek+3

Detailed Article: https://autobe.dev/articles/local-llm-benchmark-about-backend-generation.html Five months ago I posted the "Hardcore function calling benchmark in backend coding agent" thread here. As I wrote in that post, it was an unco…
Qwen Meetup Draft Review Required (Function Calling Harness 2 - CoT Compliance from 9.91% to 100%) (autobe.dev via reddit)

+1 1d function-calling qwen

Talk at Qwen Meetup Korea end of May. Looking for review on this draft before I build PPT slides off it.
Multi-agent in production: real win or just hype? (www.reddit.com)

+12 4d function-calling agentic

Trying to get an honest read on this from people actually shipping. Every other AI announcement lately is "agentic" or "multi-agent," and I can't always tell if it's a real architectural shift or rebranded function calling with extra steps.
Learn, run and test Agentic AI on your browser for free! (Built with Claude Opus 4.7 in 2 days) (www.reddit.com)

+48 4d function-calling fine-tuning rag+4

Hey Everyone, Over the last few months, I noticed a massive gap in how we learn about Agentic AI. There are a million theoretical blog posts and dense whitepapers on RAG, tool calling, and swarms, but almost nowhere to just sit down, run a…
- Run, Learn and test Agentic AI for free, on your browser! (Open AI Models are included) (www.reddit.com)
Show HN: I built a search engine for llms.txt sites (statespace.com via hn)

+2 5d function-calling vector-database mistral+1

More and more developer tools are adopting the llms.txt standard to build AI-friendly versions of their docs. The problem is that it's very hard to search across them.
Qwen 3.6 27B BF16 vs Q4_K_M vs Q8_0 GGUF evaluation (www.reddit.com)

+9035 5d humaneval function-calling llama+1

Evaluated Qwen 3.6 27B across BF16, Q4_K_M, and Q8_0 GGUF quant variants with llama-cpp-python using Neo AI Engineer. Benchmarks used: HumanEval: code generation HellaSwag: commonsense reasoning BFCL: function calling Total samples: HumanE…
The model alone is not the agent. The harness plus the model is the agent (www.reddit.com)

+16 11d function-calling agentic
Context checkpoint erasure in llama.cpp ? (www.reddit.com)

+37 2w function-calling llama qwen

Has anyone been able to solve or mitigate context checkpoints being erased during single user inference, specifically when function calling is part of the chat history? I've been using Qwen 3.5 35B A3B for some time (now using 3.6), tested…
Benchmarked Gemma 4 E2B: The 2B model beat every larger sibling on multi-turn (70%) (aiexplr.com via reddit)

+2612 2w function-calling prompt-injection security+2

Tested Gemma 4 E2B across 10 enterprise task suites against Gemma 2 2B, Gemma 3 4B, Gemma 4 E4B, and Gemma 3 12B. Run locally on Apple Silicon.
Defender – Local prompt injection detection for AI agents (no API calls) (www.npmjs.com via hn)

+1 2w tool-calling function-calling prompt-injection+2

Prompt injection defense framework for AI tool-calling Indirect prompt injection defense and protection for AI agents using tool calls (via MCP, CLI or direct function calling). Detects and neutralizes prompt injection attacks hidden in t…
Function calling and other API updates (openai.com)

150w function-calling

← all threads