Skip to content
Change the repository type filter

All

    Repositories list

    • Storing long contexts in tiny caches with self-study
      Python
      710540Updated Jul 21, 2025Jul 21, 2025
    • Tile primitives for speedy kernels
      Cuda
      0600Updated Jul 19, 2025Jul 19, 2025
    • minions

      Public
      Big & Small LLMs working together
      Python
      1211.1k111Updated Jul 18, 2025Jul 18, 2025
    • bwler

      Public
      Official repo for BWLer: Barycentric Weight Layer
      Python
      21700Updated Jul 17, 2025Jul 17, 2025
    • Tile primitives for speedy kernels
      Cuda
      1602.5k4013Updated Jul 15, 2025Jul 15, 2025
    • zoology

      Public
      Understand and test language model architectures on synthetic tasks.
      Python
      3722121Updated Jul 13, 2025Jul 13, 2025
    • Python
      01900Updated Jun 24, 2025Jun 24, 2025
    • based

      Public
      Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
      Python
      1723730Updated Jun 6, 2025Jun 6, 2025
    • kernels, of the mega variety
      Python
      2245331Updated Jun 2, 2025Jun 2, 2025
    • hyena-dna

      Public
      Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
      Assembly
      100704327Updated Apr 22, 2025Apr 22, 2025
    • Python
      5700Updated Mar 18, 2025Mar 18, 2025
    • lolcats

      Public
      Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
      Python
      2524480Updated Jan 31, 2025Jan 31, 2025
    • aioli

      Public
      Aioli: A unified optimization framework for language model data mixing
      Jupyter Notebook
      42710Updated Jan 17, 2025Jan 17, 2025
    • FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
      C++
      29323184Updated Dec 28, 2024Dec 28, 2024
    • m2

      Public
      Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
      Assembly
      42555252Updated Dec 28, 2024Dec 28, 2024
    • meerkat

      Public
      Creative interactive views of any dataset.
      Python
      4484383Updated Dec 24, 2024Dec 24, 2024
    • smoothie

      Public
      Jupyter Notebook
      3900Updated Dec 10, 2024Dec 10, 2024
    • train-tk

      Public
      train with kittens!
      Python
      7.2k6100Updated Oct 25, 2024Oct 25, 2024
    • WONDERBREAD benchmark + dataset for BPM tasks
      Jupyter Notebook
      62610Updated Oct 20, 2024Oct 20, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      8.8k100Updated Oct 14, 2024Oct 14, 2024
    • Automating enterprise workflows with multimodal agents
      Jupyter Notebook
      1310800Updated Oct 9, 2024Oct 9, 2024
    • An open science effort to benchmark legal reasoning in foundation models
      Python
      6845796Updated Aug 25, 2024Aug 25, 2024
    • hgcn

      Public
      Hyperbolic Graph Convolutional Networks in PyTorch.
      Python
      115631203Updated Jul 25, 2024Jul 25, 2024
    • manifest

      Public
      Prompt programming with FMs.
      Python
      4544362Updated Jul 22, 2024Jul 22, 2024
    • Python
      25600Updated Jul 9, 2024Jul 9, 2024
    • safari

      Public
      Convolutions for Sequence Modeling
      Assembly
      71893251Updated Jun 13, 2024Jun 13, 2024
    • A framework for few-shot evaluation of language models.
      Python
      2.6k1000Updated Jun 8, 2024Jun 8, 2024
    • A framework for few-shot evaluation of language models.
      Python
      2.6k800Updated Jun 3, 2024Jun 3, 2024
    • axolive

      Public
      Go ahead and axolotl questions
      Python
      1.1k200Updated Jun 3, 2024Jun 3, 2024
    • Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
      Jupyter Notebook
      2.6k100Updated Jun 3, 2024Jun 3, 2024