-
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper • 2510.14972 • Published • 34 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 111 -
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning
Paper • 2510.19338 • Published • 114 -
The Smol Training Playbook
📚2.76kThe secrets to building world-class LLMs
Jonatan Borkowski
j14i
AI & ML interests
None yet
Recent Activity
updated
a collection
about 3 hours ago
Reading list
upvoted
a
paper
about 3 hours ago
Memory in the Age of AI Agents
reacted
to
sergiopaniego's
post
with ❤️
3 days ago
This super detailed tutorial by @Paulescu is pure gold 🪙 "Fine-tuning a Small Language Model for browser control with GRPO and OpenEnv"
LFM2-350M (@LiquidAI) + BrowserGym (OpenEnv) + GRPO (TRL) for learning browser control 🤝
https://paulabartabajo.substack.com/p/fine-tuning-lfm2-350m-for-browser