Preview: Tweaked Geek: Practical AI Tech

MiniMax M2.1: Open Source SOTA for Dev & Agents

MiniMax M2.1, now open source and available on Hugging Face, is setting new standards in real-world development and agent applications by achieving state-of-the-art (SOTA) performance on coding benchmarks such as SWE, VIBE, and Multi-SWE. Demonstrating superior capabilities, it surpasses notable models like Gemini 3 Pro and Claude Sonnet 4.5. With a configuration of 10 billion active parameters and a total of 230 billion parameters in a Mixture of Experts (MoE) architecture, MiniMax M2.1 offers significant advancements in computational efficiency and effectiveness for developers and AI agents. This matters because it provides the AI community with a powerful, open-source tool that enhances coding efficiency and innovation in AI applications.

Read Full Article

Posted on

Dec 26, 2025

by

Neural Nix

in

Deep Dives, Tools

Topics: AI advancements, AI tools, open source

NVIDIA’s New 72GB VRAM Graphics Card

NVIDIA has introduced a new 72GB VRAM version of its graphics card, providing a middle ground for users who find the 96GB version too costly and the 48GB version insufficient for their needs. This development is particularly significant for the AI community, where the demand for high-capacity VRAM is critical for handling large datasets and complex models efficiently. The introduction of a 72GB option offers a more affordable yet powerful solution, catering to a broader range of users who require substantial computational resources for AI and machine learning applications. This matters because it enhances accessibility to high-performance computing, enabling more innovation and progress in AI research and development.

Posted on

by

in

Topics: machine learning, AI models, Nvidia

Top Local LLMs of 2025

The year 2025 has been remarkable for open and local AI enthusiasts, with significant advancements in local language models (LLMs) like Minimax M2.1 and GLM4.7, which are now approaching the performance of proprietary models. Enthusiasts are encouraged to share their favorite models and detailed experiences, including their setups, usage nature, and tools, to help evaluate these models' capabilities given the challenges of benchmarks and stochasticity. The discussion is organized by application categories such as general use, coding, creative writing, and specialties, with a focus on open-weight models. Participants are also advised to classify their recommendations based on model memory footprint, as using multiple models for different tasks is beneficial. This matters because it highlights the progress and potential of open-source LLMs, fostering a community-driven approach to AI development and application.

Read Full Article

Posted on

Dec 26, 2025

by

Neural Nix

in

Commentary, Deep Dives, Tools

Topics: AI advancements, AI models, AI tools

Google’s FunctionGemma: AI for Edge Function Calling

Google has introduced FunctionGemma, a specialized version of the Gemma 3 270M model, designed specifically for function calling and optimized for edge workloads. FunctionGemma retains the Gemma 3 architecture but focuses on translating natural language into executable API actions rather than general chat. It uses a structured conversation format with control tokens to manage tool definitions and function calls, ensuring reliable tool use in production. The model, trained on 6 trillion tokens, supports a 256K vocabulary optimized for JSON and multilingual text, enhancing token efficiency. FunctionGemma's primary deployment target is edge devices like phones and laptops, benefiting from its compact size and quantization support for low-latency, low-memory inference. Demonstrations such as Mobile Actions and Tiny Garden showcase its ability to perform complex tasks on-device without server calls, achieving up to 85% accuracy after fine-tuning. This development signifies a step forward in creating efficient, localized AI solutions that can operate independently of cloud infrastructure, crucial for privacy and real-time applications.

Posted on

by

in

Topics: AI advancements, Privacy, quantization

Rodeo: AI-Powered App for Planning with Friends

Rodeo is a new app designed to simplify the process of planning activities with existing friends by utilizing AI technology. Founded by former Hinge executives, the app addresses the common struggle of organizing social events amidst busy schedules filled with work and family commitments. Rodeo can transform social media posts, event ads, or group chat screenshots into actionable plans by integrating details like venues and showtimes, and even facilitating ticket purchases. Users can create and share collaborative lists for future activities, making it easier to coordinate with friends. While the app leverages AI to streamline these processes, its founders have chosen not to heavily market this feature, recognizing that many users prefer AI to remain unobtrusive in their personal lives. Currently in an invite-only beta phase, Rodeo aims to tap into the growing demand for organizational tools similar to Notion and Obsidian, positioning itself as a "second brain" for social planning. This matters because it offers a novel solution to the common challenge of maintaining friendships in a busy world by using technology to simplify and enhance social coordination.