Mega Context

Tag: papers

21 items with this tag.

  • Nov 30, 2025

    Transformer-XL

    • papers
    • reference
  • Nov 30, 2025

    Sparse Transformers

    • papers
    • reference
  • Nov 30, 2025

    RoPE

    • papers
    • reference
  • Nov 30, 2025

    Slot Attention - 2006.15055v2

    • papers
    • reference
  • Nov 30, 2025

    Reformer

    • papers
    • reference
  • Nov 30, 2025

    RETRO

    • papers
    • reference
    • retrieval
    • architecture
  • Nov 30, 2025

    RAG - 2005.11401v4

    • papers
    • reference
  • Nov 30, 2025

    Perceiver IO - 2107.14795v3

    • papers
    • reference
  • Nov 30, 2025

    Neural Turing Machines

    • papers
    • reference
    • memory-architecture
    • attention-mechanisms
  • Nov 30, 2025

    Perceiver - 2103.03206v2

    • papers
    • reference
  • Nov 30, 2025

    Memorizing Transformers

    • papers
    • reference
  • Nov 30, 2025

    LongLoRA

    • papers
    • reference
  • Nov 30, 2025

    MegaTexture

    • papers
    • reference
    • placeholder
  • Nov 30, 2025

    LoRA

    • papers
    • reference
    • parameter-efficient-training
    • fine-tuning
  • Nov 30, 2025

    Knowledge Distillation

    • papers
    • reference
  • Nov 30, 2025

    LLMLingua-2 - 2403.12968v2

    • papers
    • reference
  • Nov 30, 2025

    Flash Attention

    • papers
    • reference
  • Nov 30, 2025

    Gist Tokens - 2304.08467v3

    • papers
    • reference
  • Nov 30, 2025

    Compressive Transformer

    • papers
    • reference
  • Nov 30, 2025

    DNC

    • papers
    • reference
    • memory-architecture
    • temporal-linking
  • Nov 30, 2025

    DeepSeek_OCR_paper

    • papers
    • reference

Created with Quartz v4.5.2 © 2025

  • GitHub
  • Discord Community