global attention

  • Open Sourced Loop Attention for Qwen3-0.6B


    [D] Open sourced Loop Attention for Qwen3-0.6B: two-pass global + local attention with a learnable gate (code + weights + training script)Loop Attention is an innovative approach designed to enhance small language models, specifically Qwen-style models, by implementing a two-pass attention mechanism. It first performs a global attention pass followed by a local sliding window pass, with a learnable gate that blends the two, allowing the model to adaptively focus on either global or local information. This method has shown promising results, reducing validation loss and perplexity compared to baseline models. The open-source release includes the model, attention code, and training scripts, encouraging collaboration and further experimentation. This matters because it offers a new way to improve the efficiency and accuracy of language models, potentially benefiting a wide range of applications.

    Read Full Article: Open Sourced Loop Attention for Qwen3-0.6B

  • IQuest-Coder-V1: A New Approach to Code Evolution


    IQuest-Coder-V1 Technical ReportIQuest-Coder-V1 introduces an innovative approach to training models on codebase evolution by focusing on repository commit transitions, allowing the model to learn how patches develop over time. LoopCoder modifies the traditional transformer setup by utilizing the same layer stack twice with shared weights, enabling the model to refine its understanding in a second pass rather than locking into initial outputs. This iterative process combines global attention on the first pass with local attention on the second, effectively blending insights to improve coding task performance. By training on extensive token contexts that include reasoning and agent trajectories, the model enhances its ability to identify and fix bugs in a codebase, reflecting the iterative nature of real-world coding solutions. This matters because it offers a more refined and efficient method for automated code understanding and bug fixing, aligning closely with the iterative processes used by human developers.

    Read Full Article: IQuest-Coder-V1: A New Approach to Code Evolution