Close Menu
Savannah HeraldSavannah Herald
    • Home
    • News
      • Local
      • State
      • National
      • World
      • HBCUs
    • Events
    • Directories
    • Weather
    • Traffic
    • Sports
    • Politics
    • Lifestyle
      • Faith
      • Senior Living
      • Health
      • Travel
      • Beauty
      • Fashion
      • Food
      • Art & Literature
    • Business
      • Real Estate
      • Entertainment
      • Investing
      • Education
    • Guides
      • Summer Camp Guide
      • Juneteenth Guide
      • Black History Savannah
      • MLK Guide Savannah
    We're Social
    • Twitter
    • Facebook
    • YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Trending
    • Calling Retired Teachers! Your Retirement Benefits Continue. So Can Your Impact.
    • As the only Black woman on the Supreme Court, Ketanji Brown Jackson’s lone dissent in conversion therapy ruling stands out
    • Gullah Geechee people offered chance to save family properties passed down through generations | US news
    • Charleston’s Gullah Geechee Community Demand 7,000 Acres in Reparations
    • Best New Music This Week: Latto Releases Highly-Anticipated ‘Big Mama,’ WILLOW Wants To “Talk On The Hill,” Monaleo And More – Essence
    • Savannah State University Announces Historic $42.8 Million State Investment to Advance Student Wellness and Engagement
    • Foster Sylvers, singer behind ‘Misdemeanor’ and member of The Sylvers, dead at 64
    • Dua Lipa and Callum Turner Marry In London, Legally Affirming Status as Internet’s Favorite Couple
    Facebook X (Twitter) Instagram YouTube
    Login
    Savannah HeraldSavannah Herald
    Savannah HeraldSavannah Herald
    Home » MiniMax-M2 is the new king of open source LLMs (especially for agentic tool calling)
    Tech

    MiniMax-M2 is the new king of open source LLMs (especially for agentic tool calling)

    Savannah HeraldBy Savannah HeraldOctober 28, 20259 Mins Read
    Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    MiniMax-M2 is the new king of open source LLMs (especially for agentic tool calling)
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Tomorrow’s Tech, Today: Innovation That Moves Us Forward

    Key takeaways
    • MiniMax-M2 leads open-weight LLMs with top-tier agentic tool-calling and near-proprietary performance on τ²-Bench, BrowseComp, and FinSearchComp-global.
    • MIT-licensed release and API support for OpenAI/Anthropic standards enable enterprise-friendly deployment, retraining, and commercial use.
    • Efficient Mixture-of-Experts design (230B total, 10B active) delivers scalable agentic reasoning with lower GPU requirements and competitive token pricing.

    Watch out, DeepSeek and Qwen! There's a new king of open source large language models (LLMs), especially when it comes to something enterprises are increasingly valuing: agentic tool use — that is, the ability to go off and use other software capabilities like web search or bespoke applications — without much human guidance.

    That model is none other than MiniMax-M2, the latest LLM from the Chinese startup of the same name. And in a big win for enterprises globally, the model is available under a permissive, enterprise-friendly MIT License, meaning it is made available freely for developers to take, deploy, retrain, and use how they see fit — even for commercial purposes. It can be found on Hugging Face, GitHub and ModelScope, as well as through MiniMax's API here. It supports OpenAI and Anthropic API standards, as well, making it easy for customers of said proprietary AI startups to shift out their models to MiniMax's API, if they want.

    According to independent evaluations by Artificial Analysis, a third-party generative AI model benchmarking and research organization, M2 now ranks first among all open-weight systems worldwide on the Intelligence Index—a composite measure of reasoning, coding, and task-execution performance.

    In agentic benchmarks that measure how well a model can plan, execute, and use external tools—skills that power coding assistants and autonomous agents—MiniMax’s own reported results, following the Artificial Analysis methodology, show τ²-Bench 77.2, BrowseComp 44.0, and FinSearchComp-global 65.5.

    These scores place it at or near the level of top proprietary systems like GPT-5 (thinking) and Claude Sonnet 4.5, making MiniMax-M2 the highest-performing open model yet released for real-world agentic and tool-calling tasks.

    What It Means For Enterprises and the AI Race

    Built around an efficient Mixture-of-Experts (MoE) architecture, MiniMax-M2 delivers high-end capability for agentic and developer workflows while remaining practical for enterprise deployment.

    For technical decision-makers, the release marks an important turning point for open models in business settings. MiniMax-M2 combines frontier-level reasoning with a manageable activation footprint—just 10 billion active parameters out of 230 billion total.

    This design enables enterprises to operate advanced reasoning and automation workloads on fewer GPUs, achieving near-state-of-the-art results without the infrastructure demands or licensing costs associated with proprietary frontier systems.

    Artificial Analysis’ data show that MiniMax-M2’s strengths go beyond raw intelligence scores. The model leads or closely trails top proprietary systems such as GPT-5 (thinking) and Claude Sonnet 4.5 across benchmarks for end-to-end coding, reasoning, and agentic tool use.

    Its performance in τ²-Bench, SWE-Bench, and BrowseComp indicates particular advantages for organizations that depend on AI systems capable of planning, executing, and verifying complex workflows—key functions for agentic and developer tools inside enterprise environments.

    As LLM engineer Pierre-Carl Langlais aka Alexander Doria posted on X: "MiniMax [is] making a case for mastering the technology end-to-end to get actual agentic automation."

    Compact Design, Scalable Performance

    MiniMax-M2’s technical architecture is a sparse Mixture-of-Experts model with 230 billion total parameters and 10 billion active per inference.

    This configuration significantly reduces latency and compute requirements while maintaining broad general intelligence.

    The design allows for responsive agent loops—compile–run–test or browse–retrieve–cite cycles—that execute faster and more predictably than denser models.

    For enterprise technology teams, this means easier scaling, lower cloud costs, and reduced deployment friction. According to Artificial Analysis, the model can be served efficiently on as few as four NVIDIA H100 GPUs at FP8 precision, a setup well within reach for mid-size organizations or departmental AI clusters.

    Benchmark Leadership Across Agentic and Coding Workflows

    MiniMax’s benchmark suite highlights strong real-world performance across developer and agent environments. The figure below, released with the model, compares MiniMax-M2 (in red) with several leading proprietary and open models, including GPT-5 (thinking), Claude Sonnet 4.5, Gemini 2.5 Pro, and DeepSeek-V3.2.

    MiniMax-M2 achieves top or near-top performance in many categories:

    • SWE-bench Verified: 69.4 — close to GPT-5’s 74.9

    • ArtifactsBench: 66.8 — above Claude Sonnet 4.5 and DeepSeek-V3.2

    • τ²-Bench: 77.2 — approaching GPT-5’s 80.1

    • GAIA (text only): 75.7 — surpassing DeepSeek-V3.2

    • BrowseComp: 44.0 — notably stronger than other open models

    • FinSearchComp-global: 65.5 — best among tested open-weight systems

    These results show MiniMax-M2’s capability in executing complex, tool-augmented tasks across multiple languages and environments—skills increasingly relevant for automated support, R&D, and data analysis inside enterprises.

    Strong Showing in Artificial Analysis’ Intelligence Index

    The model’s overall intelligence profile is confirmed in the latest Artificial Analysis Intelligence Index v3.0, which aggregates performance across ten reasoning benchmarks including MMLU-Pro, GPQA Diamond, AIME 2025, IFBench, and τ²-Bench Telecom.

    MiniMax-M2 scored 61 points, ranking as the highest open-weight model globally and following closely behind GPT-5 (high) and Grok 4.

    Artificial Analysis highlighted the model’s balance between technical accuracy, reasoning depth, and applied intelligence across domains. For enterprise users, this consistency indicates a reliable model foundation suitable for integration into software engineering, customer support, or knowledge automation systems.

    Designed for Developers and Agentic Systems

    MiniMax engineered M2 for end-to-end developer workflows, enabling multi-file code edits, automated testing, and regression repair directly within integrated development environments or CI/CD pipelines.

    The model also excels in agentic planning—handling tasks that combine web search, command execution, and API calls while maintaining reasoning traceability.

    These capabilities make MiniMax-M2 especially valuable for enterprises exploring autonomous developer agents, data analysis assistants, or AI-augmented operational tools.

    Benchmarks such as Terminal-Bench and BrowseComp demonstrate the model’s ability to adapt to incomplete data and recover gracefully from intermediate errors, improving reliability in production settings.

    Interleaved Thinking and Structured Tool Use

    A distinctive aspect of MiniMax-M2 is its interleaved thinking format, which maintains visible reasoning traces between <think>…</think> tags.

    This enables the model to plan and verify steps across multiple exchanges, a critical feature for agentic reasoning. MiniMax advises retaining these segments when passing conversation history to preserve the model’s logic and continuity.

    The company also provides a Tool Calling Guide on Hugging Face, detailing how developers can connect external tools and APIs via structured XML-style calls.

    This functionality allows MiniMax-M2 to serve as the reasoning core for larger agent frameworks, executing dynamic tasks such as search, retrieval, and computation through external functions.

    Open Source Access and Enterprise Deployment Options

    Enterprises can access the model through the MiniMax Open Platform API and MiniMax Agent interface (a web chat similar to ChatGPT), both currently free for a limited time.

    MiniMax recommends SGLang and vLLM for efficient serving, each offering day-one support for the model’s unique interleaved reasoning and tool-calling structure.

    Deployment guides and parameter configurations are available through MiniMax’s documentation.

    Cost Efficiency and Token Economics

    As Artificial Analysis noted, MiniMax’s API pricing is set at $0.30 per million input tokens and $1.20 per million output tokens, among the most competitive in the open-model ecosystem.

    Provider

    Model (doc link)

    Input $/1M

    Output $/1M

    Notes

    MiniMax

    MiniMax-M2

    $0.30

    $1.20

    Listed under “Chat Completion v2” for M2.

    OpenAI

    GPT-5

    $1.25

    $10.00

    Flagship model pricing on OpenAI’s API pricing page.

    OpenAI

    GPT-5 mini

    $0.25

    $2.00

    Cheaper tier for well-defined tasks.

    Anthropic

    Claude Sonnet 4.5

    $3.00

    $15.00

    Anthropic’s current per-MTok list; long-context (>200K input) uses a premium tier.

    Google

    Gemini 2.5 Flash (Preview)

    $0.30

    $2.50

    Prices include “thinking tokens”; page also lists cheaper Flash-Lite and 2.0 tiers.

    xAI

    Grok-4 Fast (reasoning)

    $0.20

    $0.50

    “Fast” tier; xAI also lists Grok-4 at $3 / $15.

    DeepSeek

    DeepSeek-V3.2 (chat)

    $0.28

    $0.42

    Cache-hit input is $0.028; table shows per-model details.

    Qwen (Alibaba)

    qwen-flash (Model Studio)

    from $0.022

    from $0.216

    Tiered by input size (≤128K, ≤256K, ≤1M tokens); listed “Input price / Output price per 1M”.

    Cohere

    Command R+ (Aug 2024)

    $2.50

    $10.00

    First-party pricing page also lists Command R ($0.50 / $1.50) and others.

    Notes & caveats (for readers):

    • Prices are USD per million tokens and can change; check linked pages for updates and region/endpoint nuances (e.g., Anthropic long-context >200K input, Google Live API variants, cache discounts).

    • Vendors may bill extra for server-side tools (web search, code execution) or offer batch/context-cache discounts.

    While the model produces longer, more explicit reasoning traces, its sparse activation and optimized compute design help maintain a favorable cost-performance balance—an advantage for teams deploying interactive agents or high-volume automation systems.

    Background on MiniMax — an Emerging Chinese Powerhouse

    MiniMax has quickly become one of the most closely watched names in China’s fast-rising AI sector.

    Backed by Alibaba and Tencent, the company moved from relative obscurity to international recognition within a year—first through breakthroughs in AI video generation, then through a series of open-weight large language models (LLMs) aimed squarely at developers and enterprises.

    The company first captured global attention in late 2024 with its AI video generation tool, “video-01,” which demonstrated the ability to create dynamic, cinematic scenes in seconds. VentureBeat described how the model’s launch sparked widespread interest after online creators began sharing lifelike, AI-generated footage—most memorably, a viral clip of a Star Wars lightsaber duel that drew millions of views in under two days.

    CEO Yan Junjie emphasized that the system outperformed leading Western tools in generating human movement and expression, an area where video AIs often struggle. The product, later commercialized through MiniMax’s Hailuo platform, showcased the startup’s technical confidence and creative reach, helping to establish China as a serious contender in generative video technology.

    By early 2025, MiniMax had turned its attention to long-context language modeling, unveiling the MiniMax-01 series, including MiniMax-Text-01 and MiniMax-VL-01. These open-weight models introduced an unprecedented 4-million-token context window, doubling the reach of Google’s Gemini 1.5 Pro and dwarfing OpenAI’s GPT-4o by more than twentyfold.

    The company continued its rapid cadence with the MiniMax-M1 release in June 2025, a model focused on long-context reasoning and reinforcement learning efficiency. M1 extended context capacity to 1 million tokens and introduced a hybrid Mixture-of-Experts design trained using a custom reinforcement-learning algorithm known as CISPO. Remarkably, VentureBeat reported that MiniMax trained M1 at a total cost of about $534,700, roughly one-tenth of DeepSeek’s R1 and far below the multimillion-dollar budgets typical for frontier-scale models.

    For enterprises and technical teams, MiniMax’s trajectory signals the arrival of a new generation of cost-efficient, open-weight models designed for real-world deployment. Its open licensing—ranging from Apache 2.0 to MIT—gives businesses freedom to customize, self-host, and fine-tune without vendor lock-in or compliance restrictions.

    Features such as structured function calling, long-context retention, and high-efficiency attention architectures directly address the needs of engineering groups managing multi-step reasoning systems and data-intensive pipelines.

    As MiniMax continues to expand its lineup, the company has emerged as a key global innovator in open-weight AI, combining ambitious research with pragmatic engineering.

    Open-Weight Leadership and Industry Context

    The release of MiniMax-M2 reinforces the growing leadership of Chinese AI research groups in open-weight model development.

    Following earlier contributions from DeepSeek, Alibaba’s Qwen series, and Moonshot AI, MiniMax’s entry continues the trend toward open, efficient systems designed for real-world use.

    Artificial Analysis observed that MiniMax-M2 exemplifies a broader shift in focus toward agentic capability and reinforcement-learning refinement, prioritizing controllable reasoning and real utility over raw model size.

    For enterprises, this means access to a state-of-the-art open model that can be audited, fine-tuned, and deployed internally with full transparency.

    By pairing strong benchmark performance with open licensing and efficient scaling, MiniMaxAI positions MiniMax-M2 as a practical foundation for intelligent systems that think, act, and assist with traceable logic—making it one of the most enterprise-ready open AI models available today.

    Read the full article on the original site


    AI and Machine Learning Black Technologists Cybersecurity News Digital Innovation Emerging Technologies Future of Work Gadget Reviews Innovation in Education Minorities in Tech Silicon Valley Updates Smart Devices Software Development Startup News STEM News Tech Culture Tech Equity Tech for Good Tech Industry Updates Tech Trends Technology News
    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    Savannah Herald
    • Website

    Related Posts

    Tech June 1, 2026

    MUSIC MONDAY: “The Ultimate James Brown Collection” Playlist (LISTEN) – Good Black News

    Tech May 31, 2026

    Intel makes a bid for handheld gaming PCs with new Arc G3 processors

    Tech May 31, 2026

    Why Are Nuclear Fuel Rods Thrown Out With 90% Of Their Potential Power Left?

    Tech May 30, 2026

    Meta plans to make an AI pendant and more smart glasses soon

    Tech May 30, 2026

    Biohacking Implants: When Human Optimization Becomes Risky

    Tech May 29, 2026

    How ‘Find My Phone’ Still Works Even When Your Battery Is Dead

    Comments are closed.

    Don't Miss
    Gaming March 6, 2026By Savannah Herald03 Mins Read

    MSI unveils a lobster-like PC with a 13.3-inch touchscreen, RTX 5080X, and a quirky design that defies all conventions

    March 6, 2026

    Game On: Latest in Gaming News, Reviews & Industry Buzz MSI MEG Vision X AI…

    Devil May Cry 5 Sales Top 10 Million Units

    August 28, 2025

    Howard Alum Bryan Cook Signs $40M Bengals Deal

    March 26, 2026

    Some Alabama residents learn their water hasn’t had fluoride for years

    March 28, 2026

    Lady Tigers Drop Road Contest To Lane

    November 25, 2025
    Archives
    • June 2026
    • May 2026
    • April 2026
    • March 2026
    • February 2026
    • January 2026
    • December 2025
    • November 2025
    • October 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    Categories
    • Art & Literature
    • Beauty
    • Black History
    • Business
    • Climate
    • Culture
    • Education
    • Employment
    • Entertainment
    • Faith
    • Fashion
    • Food
    • Gaming
    • Georgia Politics
    • HBCUs
    • Health
    • Health Inspections
    • Investing
    • Lifestyle
    • Local
    • Lowcountry News
    • National
    • National Opinion
    • News
    • Politics
    • Real Estate
    • Senior Living
    • Sports
    • State
    • Tech
    • Transportation
    • Travel
    • World
    Savannah Herald Newsletter

    Subscribe to Updates

    A round up interesting pic’s, post and articles in the C-Port and around the world.

    About Us
    About Us

    The Savannah Herald is your trusted source for the pulse of Coastal Georgia and the Low County of South Carolina. We're committed to delivering timely news that resonates with the African American community.

    From local politics to business developments, we're here to keep you informed and engaged. Our mission is to amplify the voices and stories that matter, shining a light on our collective experiences and achievements.
    We cover:
    🏛️ Politics
    💼 Business
    🎭 Entertainment
    🏀 Sports
    🩺 Health
    💻 Technology
    Savannah Herald: Savannah's Black Voice 💪🏾

    Our Picks

    Leading VA Conveniences That Assistance Professionals and Their Caretakers

    November 3, 2025

    HBCU Football Returns to ABC Primetime with 2025 MEAC/SWAC Challenge

    August 28, 2025

    Bedner cucumbers are back, making individuals ill once more

    May 2, 2026

    Bret Dunn to compete State Us senate Area 42 

    August 28, 2025

    The Doux Bananas Xtreme Hold Has Me In A Chokehold!

    August 28, 2025
    Categories
    • Art & Literature
    • Beauty
    • Black History
    • Business
    • Climate
    • Culture
    • Education
    • Employment
    • Entertainment
    • Faith
    • Fashion
    • Food
    • Gaming
    • Georgia Politics
    • HBCUs
    • Health
    • Health Inspections
    • Investing
    • Lifestyle
    • Local
    • Lowcountry News
    • National
    • National Opinion
    • News
    • Politics
    • Real Estate
    • Senior Living
    • Sports
    • State
    • Tech
    • Transportation
    • Travel
    • World
    Copyright © 2002-2026 Savannahherald.com All Rights Reserved. A Veteran-Owned Business

    Type above and press Enter to search. Press Esc to cancel.

    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}
    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.

    Sign In or Register

    Welcome Back!

    Login below or Register Now.

    Lost password?

    Register Now!

    Already registered? Login.

    A password will be e-mailed to you.