256k context

  • LGAI-EXAONE/K-EXAONE-236B-A23B-GGUF Model Overview


    LGAI-EXAONE/K-EXAONE-236B-A23B-GGUF · Hugging FaceThe LGAI-EXAONE/K-EXAONE-236B-A23B-GGUF model is a highly efficient AI architecture featuring a 236 billion parameter design with 23 billion active parameters, optimized with Multi-Token Prediction (MTP) for enhanced inference throughput. It supports a 256K context window using a hybrid attention scheme, significantly reducing memory usage for long-document processing. The model offers multilingual support across six languages with an improved 150k vocabulary for better token efficiency and demonstrates advanced tool-use and search capabilities through multi-agent strategies. Additionally, it is aligned with universal human values and incorporates Korean cultural contexts to address regional sensitivities, ensuring high reliability across diverse risk categories. This matters because it represents a significant advancement in AI efficiency, multilingual capabilities, and cultural sensitivity, potentially impacting various applications and industries.

    Read Full Article: LGAI-EXAONE/K-EXAONE-236B-A23B-GGUF Model Overview

  • AI21 Labs Unveils Jamba2 Mini Model


    AI21 Labs releases Jamba2AI21 Labs has launched Jamba2, a series of open-source language models designed for enterprise use, including the Jamba2 Mini with 52 billion parameters. This model is optimized for precise question answering and offers a memory-efficient solution with a 256K context window, making it suitable for processing large documents like technical manuals and research papers. Jamba2 Mini excels in benchmarks such as IFBench and FACTS, demonstrating superior reliability and performance in real-world enterprise tasks. Released under the Apache 2.0 License, it is fully open-source for commercial use, offering a scalable and production-optimized solution with a lean memory footprint. Why this matters: Jamba2 provides businesses with a powerful and efficient tool for handling complex language tasks, enhancing productivity and accuracy in enterprise environments.

    Read Full Article: AI21 Labs Unveils Jamba2 Mini Model

  • Falcon-H1R-7B: Compact Model Excels in Reasoning


    TII Abu-Dhabi Released Falcon H1R-7B: A New Reasoning Model Outperforming Others in Math and Coding with only 7B Params with 256k Context WindowThe Technology Innovation Institute in Abu Dhabi has introduced Falcon-H1R-7B, a compact 7 billion parameter model that excels in math, coding, and general reasoning tasks, outperforming larger models with up to 47 billion parameters. This model employs a hybrid architecture combining Transformer layers with Mamba2 components, allowing for efficient long-sequence processing with a context window of up to 256,000 tokens. It undergoes a two-stage training process involving supervised fine-tuning and reinforcement learning, which enhances its reasoning capabilities. Falcon-H1R-7B demonstrates impressive performance across various benchmarks, achieving high scores in math and coding tasks, and offers significant improvements in throughput and accuracy through its innovative design. This matters because it showcases how smaller, well-designed models can rival larger ones in performance, offering more efficient solutions for complex reasoning tasks.

    Read Full Article: Falcon-H1R-7B: Compact Model Excels in Reasoning