Archive - m0nads

Reasoning without External Rewards

From self-certainty to Reinforcement Learning from Internal Feedback

Jul 20, 2025 • m0nads

LLMs: a 5 mins trip

What Large Language Models actually do in mathematical terms

Jul 15, 2025 • m0nads

The Kullback-Leibler divergence

A fundamental comparison tool

Jul 3, 2025 • m0nads

February 2025

Group Relative Policy Optimization

An efficient and effective reinforcement learning algorithm

Feb 25, 2025 • m0nads

January 2025

Minimal RAG model

Using Cohere and SerpAPI

Jan 26, 2025 • m0nads

Byte Latent Transformer

The Efficiency of Dynamic Byte Patching

Jan 14, 2025 • m0nads

August 2024

Exploring Florence-2

Multimodal AI with Unified Vision-Language Capabilities

Aug 12, 2024 • m0nads

June 2024

Covariance and Correlation

Statistical quantities for variables relationships

Jun 17, 2024 • m0nads

May 2024

Tiny but mighty

May 29, 2024 • m0nads

Local RAG using LLaMA3

A quick Retrieval-Augmented Generation model

May 24, 2024 • m0nads

"A tiny open-source computer-vision model that runs everywhere and kicks ass" (cit.)

May 24, 2024 • m0nads

April 2024

LLM training can be much cheaper than people generally thought

Apr 22, 2024 • m0nads

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts