open source

  • Explore and Compare Models with Open-Source Tool


    Built a models.dev wrapper to search/compare models + open-weight alternatives (open source)A new tool has been developed to enhance the models.dev catalog, allowing users to search, compare, and rank models efficiently while also identifying open-weight alternatives with detailed scoring explanations. This tool features fast search capabilities with on-demand catalog fetching, ensuring minimal data is sent to the client. It also provides token cost estimates and shareable specification cards, all under an open-source MIT license, encouraging community contributions for improvements. This matters because it facilitates more informed decision-making in model selection and fosters collaboration in the open-source community.

    Read Full Article: Explore and Compare Models with Open-Source Tool

  • Infinitely Scalable Recursive Model (ISRM) Overview


    ISRM: Infinitely Scalable Recursive ModelThe Infinitely Scalable Recursive Model (ISRM) is a new architecture developed as an improvement over Samsung's TRM, with the distinction of being fully open source. Although the initial model was trained quickly on a 5090 and is not recommended for use yet, it allows for personal training and execution of the ISRM. The creator utilized AI minimally, primarily for generating the website and documentation, while the core code remains largely free from AI influence. This matters because it offers a new, accessible approach to scalable model architecture, encouraging community involvement and further development.

    Read Full Article: Infinitely Scalable Recursive Model (ISRM) Overview

  • Open-source People-Matching System


    Open-source pause: what we’re actually building and where help is welcomeAn open-source project is developing a people-matching system that extends beyond dating to include connections for friendship, hobbies, projects, and more. Users are onboarded through an AI-guided interview, which gathers structured data to create embedded representations of their profiles. The challenge lies in efficiently finding the best matches among a growing user base, requiring innovative search and ranking strategies beyond simple neural networks. The project invites contributions from the open-source community to tackle this complex problem, emphasizing collaboration and open discussion over financial incentives. This matters because it leverages community-driven innovation to address a complex social networking challenge.

    Read Full Article: Open-source People-Matching System

  • Temporal LoRA: Dynamic Adapter Router for GPT-2


    [Experimental] "Temporal LoRA": A dynamic adapter router that switches context (Code vs. Lit) with 100% accuracy. Proof of concept on GPT-2.Temporal LoRA introduces a dynamic adapter router that allows models to switch between different contexts, such as coding and literature, with 100% accuracy. By training distinct LoRA adapters for different styles and implementing a "Time Mixer" network, the system can dynamically activate the appropriate adapter based on input context, maintaining model stability while allowing for flexible task switching. This approach provides a promising method for integrating Mixture of Experts (MoE) in larger models without the need for extensive retraining, enabling seamless "hot-swapping" of skills and enhancing multi-tasking capabilities. This matters because it offers a scalable solution for improving AI model adaptability and efficiency in handling diverse tasks.

    Read Full Article: Temporal LoRA: Dynamic Adapter Router for GPT-2

  • Debate Hall MCP: Multi-Agent Decision Tool


    Debate Hall mcp server - multi-agent decision making tool (open sourced. please try it out)A new multi-agent decision-making tool called Debate Hall MCP server has been developed to facilitate structured debates between three cognitive perspectives—Pathos (Wind), Ethos (Wall), and Logos (Door)—to enhance decision-making processes. This tool is based on Plato's modes of reasoning and allows AI agents to explore possibilities, ground ideas in reality, and synthesize solutions, thereby offering more nuanced solutions than single-agent approaches. The system can be configured using different AI models, such as Gemini, Codex, and Claude, and features hash chain verification, GitHub integration, and flexible modes to ensure efficient and tamper-evident debates. By open-sourcing this tool, the developer seeks feedback on its usability and effectiveness in improving decision-making. This matters because it introduces a novel way to harness AI for more comprehensive and accurate decision-making.

    Read Full Article: Debate Hall MCP: Multi-Agent Decision Tool

  • FlakeStorm: Chaos Engineering for AI Agent Testing


    [P] FlakeStorm: Chaos Engineering for AI Agent Testing (Apache 2.0, Rust-accelerated)FlakeStorm is an open-source testing engine designed to enhance AI agent testing by incorporating chaos engineering principles. It addresses the limitations of current testing methods, which often overlook non-deterministic behaviors and system-level failures, by introducing chaos injection as a primary testing strategy. The engine generates semantic mutations across various categories such as paraphrasing, noise, tone shifts, and adversarial inputs to test AI agents' robustness under adversarial and edge case conditions. FlakeStorm's architecture complements existing testing tools, offering a comprehensive approach to AI agent reliability and security, and is built with Python for compatibility, with optional Rust extensions for performance improvements. This matters because it provides a more thorough testing framework for AI agents, ensuring they perform reliably even under unpredictable conditions.

    Read Full Article: FlakeStorm: Chaos Engineering for AI Agent Testing

  • Persistent Memory for Codex CLI with Clauder


    Built an MCP server that gives Codex CLI persistent memory across sessionsClauder, an MCP server, now supports Codex CLI to provide persistent memory across sessions, addressing the issue of having to repeatedly explain codebases and architectural decisions in new Codex sessions. By storing context in a local SQLite database, Clauder automatically loads relevant information when a session starts, allowing users to store and recall facts, decisions, and conventions effortlessly. This setup, which also supports Claude Code, OpenCode, and Gemini CLI, enhances workflow efficiency by enabling cross-instance messaging for multi-terminal environments. The project is open source and MIT licensed, inviting feedback and contributions from the community. Why this matters: Persistent memory across sessions streamlines coding workflows by reducing repetitive explanations, enhancing productivity and collaboration.

    Read Full Article: Persistent Memory for Codex CLI with Clauder

  • DeepSeek’s mHC: A New Era in AI Architecture


    A deep dive in DeepSeek's mHC: They improved things everyone else thought didn’t need improvingSince the introduction of ResNet in 2015, the Residual Connection has been a fundamental component in deep learning, providing a solution to the vanishing gradient problem. However, its rigid 1:1 input-to-computation ratio limits the model's ability to dynamically balance past and new information. DeepSeek's innovation with Manifold-Constrained Hyper-Connections (mHC) addresses this by allowing models to learn connection weights, offering faster convergence and improved performance. By constraining these weights to be "Double Stochastic," mHC ensures stability and prevents exploding gradients, outperforming traditional methods and reducing training time impact. This advancement challenges long-held assumptions in AI architecture, promoting open-source collaboration for broader technological progress.

    Read Full Article: DeepSeek’s mHC: A New Era in AI Architecture

  • Open Sourced Loop Attention for Qwen3-0.6B


    [D] Open sourced Loop Attention for Qwen3-0.6B: two-pass global + local attention with a learnable gate (code + weights + training script)Loop Attention is an innovative approach designed to enhance small language models, specifically Qwen-style models, by implementing a two-pass attention mechanism. It first performs a global attention pass followed by a local sliding window pass, with a learnable gate that blends the two, allowing the model to adaptively focus on either global or local information. This method has shown promising results, reducing validation loss and perplexity compared to baseline models. The open-source release includes the model, attention code, and training scripts, encouraging collaboration and further experimentation. This matters because it offers a new way to improve the efficiency and accuracy of language models, potentially benefiting a wide range of applications.

    Read Full Article: Open Sourced Loop Attention for Qwen3-0.6B

  • LEMMA: Rust-based Neural-Guided Theorem Prover


    [P] LEMMA: A Rust-based Neural-Guided Theorem Prover with 220+ Mathematical RulesLEMMA is an open-source symbolic mathematics engine that integrates Monte Carlo Tree Search (MCTS) with a learned policy network to improve theorem proving. It addresses the shortcomings of large language models, which can produce incorrect proofs, and traditional symbolic solvers, which struggle with the complexity of rule applications. By using a small transformer network trained on synthetic derivations, LEMMA predicts productive rule applications, enhancing the efficiency of symbolic transformations across various mathematical domains like algebra, calculus, and number theory. Implemented in Rust without Python dependencies, LEMMA offers consistent search latency and recently added support for summation, product notation, and number theory primitives. This matters because it represents a significant advancement in combining symbolic computation with neural network intuition, potentially improving automated theorem proving.

    Read Full Article: LEMMA: Rust-based Neural-Guided Theorem Prover