ðŸŠī Berwin Gan

            • Luminal - Search-Based Deep Learning Compilers
              • REOrdering Patches Improves Vision Models
              • FlexAttention
              • How Attention Sinks Keep Language Models Stable
              • Muon - An optimizer for hidden layers in neural networks
            • Distillation Robustifies Unlearning
            • H-Net
            • Hierarchical Reasoning Model
            • Language Models in Plato's Cave
            • Learning Compositional Models of the World
            • STP: Self-Play LLM Theorem Provers with Iterative Conjecturing and Proving
      • Aggregate Voting Rank ðŸ—ģïļ
      • Covering Discs and Orthants 📐
      • Lambda Calculus ðŸ§Ū
    Home

    âŊ

    Notes 🗒ïļ

    âŊ

    Machine Learning ðŸĪ–

    âŊ

    Research

    âŊ

    Holding

    Folder: Notes-🗒ïļ/Machine-Learning-ðŸĪ–/Research/Holding

    1 item under this folder.

    • Jun 15, 2025

      REOrdering Patches Improves Vision Models

      • vision
      • transformer

    Created with Quartz v4.4.0 ÂĐ 2025

    • GitHub
    • Discord Community