Gonzo ML

Gonzo ML

Home
Archive
About

Sitemap - 2023 - Gonzo ML

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

For Distillation, Tokens Are Not All You Need

Toward understanding the communication in sperm whales

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

[S4] Efficiently Modeling Long Sequences with Structured State Spaces

Conway’s Game of Life is Omniperiodic

[Google] Gemini: A Family of Highly Capable Multimodal Models

System 2 Attention (is something you might need too)

🪆Matryoshka Representation Learning

Mindstorms in Natural Language-Based Societies of Mind

The convolution empire strikes back

Sparse Universal Transformer

MemWalker

"Building Machines That Learn and Think Like People", 7 years later

Chain-of-Thought → Tree-of-Thought

Borges and AI

GPT-4V is coming!

Turing, “Intelligent Machinery, A Heretical Theory”, 1951

Mortal Computers

Large Language Model Programs

Generative Agents: Interactive Simulacra of Human Behavior

Uncovering mesa-optimization algorithms in Transformers

Textbooks Are All You Need II: phi-1.5 technical report

Learning to Model the World with Language

The Guardian, 2024

The Generative Cyberiad

© 2025 Grisha
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share