Close Menu
Savannah HeraldSavannah Herald
    • Home
    • News
      • Local
      • State
      • National
      • World
      • HBCUs
    • Events
    • Directories
    • Weather
    • Traffic
    • Sports
    • Politics
    • Lifestyle
      • Faith
      • Senior Living
      • Health
      • Travel
      • Beauty
      • Fashion
      • Food
      • Art & Literature
    • Business
      • Real Estate
      • Entertainment
      • Investing
      • Education
    • Guides
      • Summer Camp Guide
      • Juneteenth Guide
      • Black History Savannah
      • MLK Guide Savannah
    We're Social
    • Twitter
    • Facebook
    • YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Trending
    • Karl-Anthony Towns says he felt late mother’s presence in NBA Finals Game 1
    • Nick Bilton, New ‘60 Minutes’ Chief, Pledges Independence
    • Perfect Vegan Strawberry Muffins | Jessica in the Kitchen
    • Deadly Listeria outbreak traced to Clover Hill cheese
    • 9 Best Brown Mascaras for When Black Feels Like Too Much
    • Witchcraft as Spiritual Activism by Freia Serafina and Amie Ritchie – Feminism and Religion
    • Alo Outfit Inspo & New Colors of the Season » coco bassey
    • A Story Not Really About Racism, But Maybe?
    Facebook X (Twitter) Instagram YouTube
    Login
    Savannah HeraldSavannah Herald
    Savannah HeraldSavannah Herald
    Home » Software engineer on the real state of AI agents (they’re not there yet)
    Tech

    Software engineer on the real state of AI agents (they’re not there yet)

    Savannah HeraldBy Savannah HeraldSeptember 3, 20255 Mins Read
    Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    Software engineer on the real state of AI agents (they
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Tomorrow’s Tech, Today: Innovation That Moves Us Forward

    Key takeaways
    • Utkarsh Kanwat warns compounded error rates make long autonomous workflows unreliable; optimistic per-step reliability of LLMs still collapses success.
    • Design agents with 3 to 5 discrete, verifiable steps, explicit rollback points, and human confirmation gates.
    • Conversational agents incur quadratic token costs; stateless function-generation (description in, function out) avoids context and runaway expenses.
    • Tool design is often underestimated; tools must return compact, structured feedback so agents stay within limited context windows.
    • Enterprise constraints like legacy APIs, rate limits, and compliance demand connection pooling, transaction rollbacks, and audit logging managed by conventional engineering.

    A hot potato: Amid growing hype around AI agents, one experienced engineer has brought a grounded perspective shaped by work on more than a dozen production-level systems spanning development, DevOps, and data operations. From his vantage point, the notion that 2025 will bring truly autonomous workforce-transforming agents looks increasingly unrealistic.

    In a recent blog post, systems engineer Utkarsh Kanwat points to fundamental mathematical constraints that challenge the notion of fully autonomous multi-step agent workflows. Since production-grade systems require upwards of 99.9 percent reliability, the math quickly makes extended autonomous workflows unfeasible.

    “If each step in an agent workflow has 95 percent reliability, which is optimistic for current LLMs, five steps yield 77 percent success, 10 steps 59 percent, and 20 steps only 36 percent,” Kanwat explained.

    Even hypothetically improved per-step reliability of 99 percent falls short at about 82 percent success for 20 steps.

    “This isn’t a prompt engineering problem. This isn’t a model capability problem. This is mathematical reality,” Kanwat says.

    Kanwat’s DevOps agent avoids the compounded error problem by breaking workflows into 3 to 5 discrete, independently verifiable steps, each with explicit rollback points and human confirmation gates. This design approach – emphasizing bounded contexts, atomic operations, and optional human intervention at critical junctures – forms the foundation of every reliable agent system he has built. He warns that attempting to chain too many autonomous steps inevitably leads to failure due to compounded error rates.

    Token cost scaling in conversational agents presents a second, often overlooked barrier. Kanwat illustrates this through his experience prototyping a conversational database agent, where each new interaction had to process the full previous context – causing token costs to scale quadratically with conversation length.

    In one case, a 100-turn exchange cost between $50 and $100 in tokens alone, making widespread use economically unsustainable. Kanwat’s function-generation agent sidestepped the issue by remaining stateless: description in, function out – no context to maintain, no conversation to track, and no runaway costs.

    “The most successful ‘agents’ in production aren’t conversational at all,” Kanwat says. “They’re smart, bounded tools that do one thing well and get out of the way.”

    Beyond the mathematical constraints lies a deeper engineering challenge: tool design. Kanwat argues this aspect is often underestimated amid the broader hype around agents. While tool invocation has become relatively precise, he says the real difficulty lies in designing tools that provide structured, actionable feedback without overwhelming the agent’s limited context window.

    For example, a well-designed database tool should summarize results in a compact, digestible format – indicating that a query succeeded, returned 10 thousand results, and displaying only a handful – rather than overwhelming the agent with raw output. Handling partial success, recovery from failure, and managing interdependent operations further increases the engineering complexity.

    “My database agent works not because the tool calls are unreliable,” Kanwat says, “but because I spent weeks designing tools that communicate effectively with the AI.”

    Kanwat critiques companies that promote simplistic “just connect your APIs” solutions, saying they often design tools for humans rather than for AI systems. As a result, agents may be able to call APIs, but they frequently fail to manage real workflows due to a lack of structured communication and contextual awareness.

    Kanwat notes that enterprise environments seldom provide clean APIs for AI agents. Legacy constraints, fluctuating rate limits, and strict compliance requirements all pose significant hurdles. His database agent, for instance, incorporates traditional engineering features like connection pooling, transaction rollbacks, query timeouts, and detailed audit logging – elements that fall far outside the AI’s scope.

    He emphasizes that the agent generates queries while conventional systems programming manages everything else. In his view, many companies pushing the promise of fully autonomous, full-stack agents fail to reckon with these harsh realities. The real challenge, he argues, is not AI capability but integration – and that’s where most agents fall apart.

    Kanwat’s successful agents share a common approach: AI manages complexity within clear boundaries, while humans or deterministic systems ensure control and reliability. His UI generation agent creates React components but requires human review before deployment. DevOps automation produces Terraform code that undergoes review, version control, and rollback. The CI/CD agent includes defined success criteria and rollback procedures, and the database agent confirms destructive commands before execution. This design lets AI handle the “hard parts” while preserving human oversight and traditional engineering to maintain safety and correctness.

    Looking ahead, Kanwat predicts that venture-backed startups chasing fully autonomous agents will struggle due to economic constraints and accumulating errors. Meanwhile, enterprises attempting to integrate AI with legacy software will face adoption hurdles because of complex integration issues. He believes the most successful teams will concentrate on creating specialized, domain-focused tools that apply AI to complex tasks but retain human oversight or strict operational limits. Kanwat also cautions that many companies will face a steep learning curve moving from impressive demonstrations to dependable, market-ready products.

    Read the full article on the original site


    AI and Machine Learning Black Technologists Cybersecurity News Digital Innovation Emerging Technologies Future of Work Gadget Reviews Innovation in Education Minorities in Tech Silicon Valley Updates Smart Devices Software Development Startup News STEM News Tech Culture Tech Equity Tech for Good Tech Industry Updates Tech Trends Technology News
    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
    Savannah Herald
    • Website

    Related Posts

    Tech June 4, 2026

    Denken Sie über einen Wechsel Ihres IT-Servicemanagement-Tool nach?  

    Tech June 3, 2026

    U.K. Prime Minister Condemns Violent Protests as Police Face Criticism Over Handcuffed Student’s Murder

    Tech June 3, 2026

    Apple’s Excellent 11-Inch iPad Is Now Just $299.99 In Your Favorite Colors

    Tech June 2, 2026

    Roids were all the rage at the Enhanced Games

    Tech June 2, 2026

    An AI Career Upgrade, Your Guaranteed Next Role

    Tech June 1, 2026

    MUSIC MONDAY: “The Ultimate James Brown Collection” Playlist (LISTEN) – Good Black News

    Comments are closed.

    Don't Miss
    Local November 25, 2025By Savannah Herald02 Mins Read

    Coastal Health District Hosts December Events for World AIDS Day 2025

    November 25, 2025

    Nonprofit Spotlight – Making a Difference in Our Community: In honor of World AIDS Day,…

    Derenne Middle Assistant Principal Awarded Lead4Change Fellowship

    August 6, 2025

    ‘Jerry Maguire’ agent Leigh Steinberg defends Jaxson Dart’s Trump rally

    May 28, 2026

    Virgil’s Gullah Kitchen co-owner, LGBTQ+ advocate Gee Smalls shares his life story in a new audio version of his memoir

    August 28, 2025

    Beauty Buy | Maybelline Super Stay Teddy Tint™

    August 28, 2025
    Archives
    • June 2026
    • May 2026
    • April 2026
    • March 2026
    • February 2026
    • January 2026
    • December 2025
    • November 2025
    • October 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    Categories
    • Art & Literature
    • Beauty
    • Black History
    • Business
    • Climate
    • Culture
    • Education
    • Employment
    • Entertainment
    • Faith
    • Fashion
    • Food
    • Gaming
    • Georgia Politics
    • HBCUs
    • Health
    • Health Inspections
    • Investing
    • Lifestyle
    • Local
    • Lowcountry News
    • National
    • National Opinion
    • News
    • Politics
    • Real Estate
    • Senior Living
    • Sports
    • State
    • Tech
    • Transportation
    • Travel
    • World
    Savannah Herald Newsletter

    Subscribe to Updates

    A round up interesting pic’s, post and articles in the C-Port and around the world.

    About Us
    About Us

    The Savannah Herald is your trusted source for the pulse of Coastal Georgia and the Low County of South Carolina. We're committed to delivering timely news that resonates with the African American community.

    From local politics to business developments, we're here to keep you informed and engaged. Our mission is to amplify the voices and stories that matter, shining a light on our collective experiences and achievements.
    We cover:
    🏛️ Politics
    💼 Business
    🎭 Entertainment
    🏀 Sports
    🩺 Health
    💻 Technology
    Savannah Herald: Savannah's Black Voice 💪🏾

    Our Picks

    8 Underrated Patti LaBelle Songs Every Songs Fan Ought To Listen To.– ThyBlackMan.com

    August 28, 2025

    Mike Bailey sets sights on AEW World Championship after ‘Dynamite’ win, gets support from Kevin Knight

    May 14, 2026

    Highschool volleyball: Wednesday’s boys’ Metropolis Part playoff outcomes, pairings

    August 29, 2025

    In Newark, the Healing Power of Food and Community

    May 14, 2026

    Thousands protest crime and corruption in Mexico City as ‘Gen Z’ protests gain momentum

    February 28, 2026
    Categories
    • Art & Literature
    • Beauty
    • Black History
    • Business
    • Climate
    • Culture
    • Education
    • Employment
    • Entertainment
    • Faith
    • Fashion
    • Food
    • Gaming
    • Georgia Politics
    • HBCUs
    • Health
    • Health Inspections
    • Investing
    • Lifestyle
    • Local
    • Lowcountry News
    • National
    • National Opinion
    • News
    • Politics
    • Real Estate
    • Senior Living
    • Sports
    • State
    • Tech
    • Transportation
    • Travel
    • World
    Copyright © 2002-2026 Savannahherald.com All Rights Reserved. A Veteran-Owned Business

    Type above and press Enter to search. Press Esc to cancel.

    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}
    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.

    Sign In or Register

    Welcome Back!

    Login below or Register Now.

    Lost password?

    Register Now!

    Already registered? Login.

    A password will be e-mailed to you.