Articles from Source: The-New-Stack

How GitHub plans to win developers back

2026-06-02 17:56
GitHub has faced significant challenges over the past year, with frequent outages affecting key functionalities like search and CI/CD pipelines. 🚧 In an interview, COO Kyle Daigle highlighted the unprecedented growth in developer activity, prompting the need for drastic scaling efforts. GitHub is adapting to handle up to 30 times more commits and pull requests. 📈 To address this, the platform is migrating to Microsoft’s Azure cloud and enhancing its infrastructure. 🌐 #GitHub...
Source: The New Stack
Frederic Lardinois

Microsoft really, really, really wants developers to love Windows again

2026-06-02 17:56
🚀 At the recent Build developer conference, Microsoft unveiled new features aimed at enhancing the developer experience on Windows 11. The standout addition is a default dark mode, along with a streamlined OS that minimizes distractions. Over 30 settings are optimized for developers to improve workflow and productivity. 🖥️ Jatinder Mann from Microsoft emphasized the importance of listening to developer feedback to create a fast, distraction-free environment. The new configuration also...
Source: The New Stack
Frederic Lardinois

With Intelligent Terminal, Microsoft is reinventing the Windows terminal

2026-06-02 17:56
Microsoft is set to transform the Windows terminal with its new **Intelligent Terminal** feature. This innovative tool aims to integrate coding agents like GitHub Copilot directly into the terminal, enhancing the developer experience. Instead of switching between windows to troubleshoot errors, developers can interact with their agents within the terminal itself. The Intelligent Terminal recognizes errors and provides suggestions with just a click. Customization options allow developers to...
Source: The New Stack
Frederic Lardinois

Microsoft debuts “Scout” at Build, a new personal agent for work

2026-06-02 17:56
🚀 At Microsoft Build, the company introduced Microsoft Scout, a new personal agent designed to enhance work efficiency. Scout uses existing tools like Teams and Outlook to understand workflows and handle routine tasks proactively, such as resolving scheduling conflicts and preparing for meetings. Currently available for Frontier customers, further details and a broader rollout will follow. Microsoft emphasizes the need for agents that reflect how users operate in their work environments....
Source: The New Stack
Meredith Shubel

OpenAI’s Codex adds new tools — Sites, Annotations, more plugins — for knowledge workers

2026-06-02 16:00
🚀 OpenAI has announced that 20% of Codex's 5 million weekly users are now knowledge workers. To cater to this growing audience, new features have been introduced, including Sites and Annotations, along with specialized plugins. 🖥️ **Sites** allows users to create and share interactive websites tailored to their work. 📝 **Annotations** enables users to edit specific document sections directly within Codex. These updates aim to enhance workplace productivity and collaboration. #OpenAI #Codex...
Source: The New Stack
Frederic Lardinois

GitHub Copilot’s usage-based billing is live: Here’s what you need to know

2026-06-02 15:07
🚀 GitHub has launched a new usage-based billing system for Copilot, officially retiring the premium request model. Now, payments are linked directly to usage with a token-based system. Each plan offers GitHub AI Credits based on token consumption, which includes input and output data. 💡 Prices remain the same, but Pro users get $15 in credits, while Pro+ users receive $70 monthly. This change aims to adapt to the evolving costs of AI infrastructure. #GitHub #Copilot #AIBilling #TechNews...
Source: The New Stack
Paul Sawers

OpenAI, Anthropic, Google, Amazon, and xAI all fail on type of attack, study finds

2026-06-01 21:01
Recent research by Cisco reveals that AI safety benchmarks may not accurately assess model performance. The study evaluated 15 models from OpenAI, Anthropic, Google, Amazon, and xAI. Key findings show that all models struggled in multi-turn attacks, with success rates varying significantly. Single-turn assessments do not reliably predict multi-turn resilience, highlighting a critical gap in current evaluation methods. Interestingly, while Anthropic's Claude family performed best in multi-turn...
Source: The New Stack
Darryl K. Taft

JetBrains open-sources Mellum2 to go where Claude Code can’t

2026-06-01 20:47
🚀 JetBrains has open-sourced Mellum2, a 12B-parameter coding model designed for agentic AI infrastructure tasks. This model expands on the original Mellum, which focused solely on code completion. Mellum2 is versatile, handling coordination, retrieval pipelines, and on-premises deployment needs. Two variants also launch with Mellum2: an “instruct” version for direct answers and a “thinking” version for complex tasks. Built with a Mixture-of-Experts architecture, it maintains speed and...
Source: The New Stack
Paul Sawers

Claude Code vs. Cursor vs. Codex vs. Antigravity — six months in

2026-06-01 17:20
In the past six months, the landscape of agentic coding tools has shifted significantly. Four key players, Claude Code, Cursor, Codex, and Antigravity, have defined the market. 🔍 Claude Code remains focused on terminal use, emphasizing long-context reasoning for large codebases. Its careful approach demands developer oversight at critical moments. 🖥️ Cursor offers flexibility, integrating with existing workflows without requiring migration. This model-agnostic tool allows developers to work...
Source: The New Stack
Janakiram MSV

This coding agent doesn’t want your feedback — it ships without it

2026-06-01 16:00
SkipLabs has launched Skipper, a closed-loop coding agent that automates backend service creation from plain-language descriptions or OpenAPI specs. Unlike other AI tools, Skipper eliminates the review cycle, functioning independently to generate and validate code. Founder Julien Verlaguet emphasizes that this approach addresses the architectural challenges of software development. With Skipper, developers can describe their needs, and the tool handles the rest. #AI #Coding...
Source: The New Stack
Darryl K. Taft

“Blowing things up”: The one move vendors got wrong on AI agents

2026-06-01 15:00
AI agents are only as effective as the context they are given, according to Hyland CEO Jitesh Ghai. At the CommunityLIVE 2026 conference, Ghai emphasized that businesses don’t need to completely overhaul their processes to integrate AI. Instead, he advocates for leveraging existing systems and data to provide necessary context. Other vendors like OpenText and Box also recognize the importance of context for enterprise AI applications. #AIAgents #EnterpriseSoftware #Hyland #ContextMatters...
Source: The New Stack
Frederic Lardinois

At Sapphire, SAP makes the case that enterprise AI is a context problem

2026-06-01 15:00
At SAP Sapphire 2026 in Orlando, SAP emphasized that winning in enterprise AI requires more than just advanced chatbots or models. It's about providing agents with the right business context, data access, and governance. 🤖📊 SAP introduced the SAP Business AI Platform, integrating various services to enhance the "Autonomous Enterprise." This includes over 50 domain-specific Joule Assistants for diverse business functions. SAP's focus is on leveraging its ERP strengths to create a robust AI...
Source: The New Stack
Frederic Lardinois

Gavriel Cohen found his own code inside OpenClaw, so he walked away

2026-05-31 17:00
Gavriel Cohen's journey with OpenClaw began with excitement, as he sought to enhance his AI marketing project. However, he quickly discovered issues within the code, including his own package, NanoPDF, unexpectedly included. His experience showed significant security concerns and a chaotic code base, with over 3,000 unresolved pull requests. This led him to walk away from OpenClaw. 🔍💻⚠️ #OpenSource #AI #Programming #Security #TechNews
Source: The New Stack
David Eastman

AI retrieval at scale is becoming a systems problem, not a tooling problem

2026-05-31 16:00
AI retrieval is evolving beyond mere embeddings and vector search. Early methods focused on semantic similarity, but now, production applications require a mix of keyword matching, ranking, and real-time signals. As systems become more complex, maintaining performance and simplicity is essential for large-scale relevance. GigaOm’s research highlights that fragmentation in retrieval architectures can slow progress and increase operational overhead. Consolidation is now viewed as a crucial...
Source: The New Stack
Tim Young

The DIY platform trap that’s burning out engineering teams

2026-05-31 15:00
🚧 Platform engineers are known for their problem-solving skills, often automating processes to improve efficiency. However, this can lead to a hidden crisis within organizations. As teams build layers of automation, they often end up with increased complexity rather than simplification. When original authors move on, understanding the context behind these automations becomes challenging, leading to potential breakdowns. This cycle can result in managing two mountains of automation, requiring...
Source: The New Stack
Darin Zook

I tested Cursor’s new Jira integration and it’s 5 stars, no notes. Here’s why.

2026-05-31 14:00
🚀 Cursor has launched its new Jira integration, designed to simplify task management by allowing users to assign tickets directly. I tested its functionality with various ticket types and found it performed well, particularly with clearly written requests. However, vague tickets posed challenges. The integration is not available for free users, and setting it up was straightforward. Overall, the integration worked effectively, showcasing its potential for developers. #Cursor #JiraIntegration...
Source: The New Stack
Jessica Wachtel

Why GPT-5.4, Claude, and Gemini can’t agree on basic, real-world facts

2026-05-30 13:11
AI models like GPT-5.4, Claude Opus 4.7, and Gemini 3 Pro are showing notable discrepancies in real-world fact-checking. 🤖 An analysis from Lenz revealed that these models disagreed on 67% of 1,000 claims, indicating a lack of consensus on basic facts. This split reflects different inference methods used by AI systems. The research, led by Kosta Jordanov, utilized real claims fact-checked by users since February 2026. #AI #MachineLearning #FactChecking #Lenz #Technology
Source: The New Stack
Adrian Bridgwater

Replit’s vibe coding platform just got a Visa-backed identity layer for AI agents — and it changes how agents spend money

2026-05-30 12:47
Replit is enhancing its platform by partnering with Visa to integrate payment infrastructure directly into its coding environment. 💻💳 This collaboration will provide developers with essential payment tools, such as tokenization and wallet management, within their workflows. Visa's investment aims to make commerce seamless in application development. A key feature is the Trusted Agent Protocol, which will help verify AI agents’ identities in real time, ensuring secure transactions. 🔒🤖 #Replit...
Source: The New Stack
Darryl K. Taft

Opus 4.8 Made Claude Smarter. Token Discipline Got Urgent.

2026-05-30 10:27
🚀 Opus 4.8 has launched, enhancing Claude's capabilities while raising concerns about potential overspending. A viral claim suggested a client spent $500 million on Claude in a month due to a lack of usage limits. This highlights the urgent need for 'token discipline' in AI usage. As AI becomes smarter, so does the importance of managing costs effectively. #AI #Opus4 #TokenDiscipline #Claude #TechNews
Source: The New Stack
Matthew Burns

Why Linux creator Linus Torvalds gets angry hearing “99% of code is AI”

2026-05-29 14:34
Linus Torvalds addressed the Open Source Summit North America, expressing his concerns about claims that "99% of code is AI." He emphasized that AI should be viewed as a productivity tool, not a replacement for human programmers. Understanding code and system architecture remains essential for long-term projects. Torvalds compared AI's impact on programming to the evolution of compilers, asserting that while AI enhances productivity, it doesn't replace the creativity and understanding...
Source: The New Stack
B. Cameron Gain

“The AI did it” won’t save you when EU regulators come knocking

2026-05-29 14:00
The EU's Cyber Resilience Act (CRA) is set to revolutionize accountability in software development. With key compliance deadlines approaching, organizations must prepare for new regulations aimed at protecting consumers from cyber threats. 🗓️ Important dates to note: - Sept 11, 2026: Reporting obligations for exploited vulnerabilities begin. - Dec 11, 2027: Major obligations for developers kick in. The CRA applies to nearly all connected products, making no distinction between human-written...
Source: The New Stack
Luis Villa

Vendor neutrality isn’t magic: A hard look at the OpenTelemetry ecosystem

2026-05-29 14:00
The OpenTelemetry (OTel) ecosystem offers a standard data format for telemetry data, aiming for vendor neutrality. This concept raises questions about its practicality. Key discussions revolve around how the community’s focus on distributed tracing led to this vendor-neutral approach. The article explores the design of the OTel standard and where its neutrality may fall short. As we look to the future, the potential for achieving complete vendor neutrality in the OpenTelemetry ecosystem...
Source: The New Stack
Adriana Villela

The fix for soaring AI cloud bills exists — so why won’t we trust it?

2026-05-29 13:00
🌐 Yasmin Rajabi, COO of CloudBolt, highlights a key issue in automation trust. While organizations embrace productivity through automation, there's hesitation in right-sizing AI cloud resources. 🔍 A recent report shows 89% of organizations prioritize right-sizing amid soaring cloud costs, yet 71% of engineers still prefer human oversight for optimization. 📅 Join The New Stack on June 24 to discuss building trust in automation for AI workloads in Kubernetes. #CloudComputing #Automation...
Source: The New Stack
Jennifer Riggins

AI is shipping code faster than security was built to handle

2026-05-29 12:00
Snyk has launched Evo Continuous Offensive Security (COS) to enhance AI-powered penetration testing. This new product aims to help companies identify and fix vulnerabilities more efficiently. Traditional pentesting often leaves a significant gap, averaging only 15 days of coverage annually. This allows attackers to exploit applications for extended periods. The demand for AI pentesting is rising, driven by rapid code deployment and the complexity of new vulnerabilities. Analysts highlight the...
Source: The New Stack
Darryl K. Taft

Why AWS scrapped OpenSearch’s architecture to chase agent workloads

2026-05-28 18:30
AWS has launched a major rebuild of its OpenSearch Serverless to better cater to agent workloads. This overhaul aims to reduce costs by up to 60% when compared to traditional clusters. The new architecture separates storage and compute, allowing resources to scale to zero when idle. This change addresses the bursty usage patterns typical of AI agents. Additionally, the service now auto-scales 20 times faster. Key features include support for search and vector collections at launch, as well as...
Source: The New Stack
Frederic Lardinois

Claude Opus 4.8 is here: effort controls, dynamic workflows, cheaper fast mode, better honesty, less deception

2026-05-28 18:08
🚀 Anthropic has launched Claude Opus 4.8, enhancing user control with new effort settings. Users can now adjust Claude's performance for various tasks, allowing for faster or more in-depth responses. The model can also handle larger coding projects through dynamic workflows. Additionally, the fast mode is now three times cheaper, and the model boasts improved honesty and support for user autonomy. #ClaudeOpus #AIInnovation #Anthropic #TechUpdate #AI
Source: The New Stack
Meredith Shubel

Percona celebrates 20th birthday with new foundation — and a goat cake

2026-05-28 17:14
🎉 Percona recently celebrated its 20th birthday with a rebrand and the launch of the OurSQL Foundation, designed to support the MySQL community. The event featured a unique goat cake, symbolizing the greatness of open-source databases. 🐐 The foundation aims to foster collaboration and independence in the MySQL ecosystem, ensuring it remains free from proprietary influences. Co-founder Vadim Tkachenko emphasized the goal of growing MySQL independently while maintaining a relationship with...
Source: The New Stack
Chris J. Preimesberger

Why OpenAI and Anthropic are hiring forward deployed engineer teams

2026-05-28 16:35
OpenAI and Anthropic are expanding their forward deployed engineering (FDE) teams to address AI integration challenges. While powerful models are essential, successful deployment requires collaboration with clients to navigate existing systems and workflows. A study found that 95% of AI projects had minimal impact due to implementation issues, not model quality. FDE teams work closely with companies, accelerating deployment and improving system reliability. Job postings for these roles surged...
Source: The New Stack
Oluwadamilola Oshungboye

Claw-style AI agents are coming to the enterprise. The governance infrastructure is still catching up.

2026-05-28 13:00
🚀 Automation Anywhere has announced its new "claw-style" AI agents with EnterpriseClaw, designed for enterprise use. These agents can autonomously access file systems, create tools at runtime, and interact with applications. Key partnerships with Nvidia, Okta, and OpenAI enhance security and functionality, including the use of GPT 5.5. However, there’s a notable gap between AI capabilities and enterprise governance. The potential for broad access raises concerns in sensitive environments like...
Source: The New Stack
Darryl K. Taft

The agentic identity crisis: Why your security isn’t ready for the AI revolution

2026-05-28 12:00
The article discusses the shift from traditional web applications to AI-driven agentic ecosystems. This transition presents new security challenges as AI agents can perform actions, leading to vulnerabilities like the Action-Based Threat Model and the RAG Attack Surface. Currently, agents operate in an Identity Vacuum, creating risks of unauthorized access and permission issues. As AI agents outnumber humans, addressing these security gaps is crucial. 🔐🤖💻 #CyberSecurity #AI...
Source: The New Stack
Justin Dolly

Debugging the undebuggable: building observability into probabilistic AI systems

2026-05-28 11:00
Debugging AI systems presents new challenges compared to traditional methods. The article highlights that failures in AI are often non-deterministic, making it hard to pinpoint issues. To address this, the tutorial emphasizes building observability into AI architectures. By focusing on instrumenting AI services, engineers can trace decision-making processes and better understand unexpected behaviors. This shift from log-based thinking to observability-driven engineering is essential for...
Source: The New Stack
Oladimeji Sowole

Snowflake commits $6B to AWS as it pushes deeper into AI

2026-05-27 20:10
🚀 Snowflake is investing $6 billion in a five-year deal with AWS to enhance its AI capabilities. This commitment focuses on using AWS’s Graviton processors and GPU-accelerated EC2 instances for AI model training. 🤝 The partnership also aims to bolster their joint presence in the AWS Marketplace, where Snowflake has seen over $7 billion in sales. 🌍 Additionally, Snowflake is expanding its AWS footprint with 10 new regions, addressing global data residency needs. #Snowflake #AWS #AI...
Source: The New Stack
Frederic Lardinois

Why MotherDuck refuses to fork DuckDB

2026-05-27 19:10
At the recent MCP Dev Summit in NYC, Till Döhmen of MotherDuck discussed the startup's approach to DuckDB. MotherDuck emphasizes collaboration with the DuckDB foundation, enhancing their offerings without forking the project. They run the largest fleet of DuckDB databases, providing valuable insights for the open-source community. Döhmen highlighted the benefits of leveraging DuckDB's extensibility while maintaining core connections. #MotherDuck #DuckDB #OpenSource #DataAnalytics...
Source: The New Stack
Nick Lucchesi

Researcher “gave Claude Code ‘ADHD’… and it thinks 2x better now.” Outside experts want more proof.

2026-05-27 18:29
🚀 This week, Udit Akhouri introduced a new Agent SDK tool on r/ClaudeCode, claiming to enhance Claude Code's performance by using a concept he calls "ADHD." ADHD, as described, enables coding agents to explore multiple thoughts and ideas simultaneously, scoring and refining them to improve decision-making. While the tool is gaining traction on GitHub, some experts question its novelty and effectiveness. Akhouri clarifies that ADHD is intended for brainstorming and planning rather than...
Source: The New Stack
Meredith Shubel

“There is no accountability”: AI coding agents are installing packages no one owns

2026-05-27 17:38
AI coding agents are changing software development, but as Willem Delbare from Aikido Security points out, "there is no accountability." This situation leaves companies vulnerable as AI installs packages without clear ownership of risk. 🛡️ Aikido's new solutions, like Aikido Endpoint, help monitor and block malware before installation, enhancing security while allowing developers flexibility. 🔍 The market is responding, with companies like Socket and Endor Labs also focusing on preventing...
Source: The New Stack
Darryl K. Taft

“Tokenmaxxing is real, expensive & it’s spreading”: New tools emerge to stop AI budgets from exploding

2026-05-27 17:27
New tools are emerging to combat the issue of tokenmaxxing, where AI token usage is misinterpreted as productivity. This practice can inflate budgets without clear outcomes. Uber's recent struggles highlight this concern, as their CTO revealed budget overruns linked to AI usage. The focus is shifting towards measuring actual results rather than mere consumption. Lanai's new "Token Tuner" aims to help organizations manage and reduce unnecessary token expenses. #AI #Tokenmaxxing #TechBudget...
Source: The New Stack
Adrian Bridgwater

With Google’s debut, the most important AI agent feature is now the most boring one

2026-05-27 15:18
Google recently announced the repositioning of Antigravity at the I/O conference, focusing on developing teams of autonomous AI agents. This trend is not new; it follows similar launches by Anthropic and AWS within a short span. Each company is addressing the same production challenges by simplifying the creation of managed agent runtimes. Markdown files, specifically AGENTS.md and SKILL.md, are emerging as the standard for defining these agents across platforms. #AI #GoogleIO #TechTrends...
Source: The New Stack
Janakiram MSV

Why AI agents need a Context Lake

2026-05-27 12:00
AI agents are powerful, but they often lack the context needed to utilize their tools effectively. Scaling these agents in organizations faces significant hurdles: security approvals can take months, and an overload of tools leads to confusion and inefficiency. Moreover, even when security and tool chaos are addressed, basic questions often go unanswered, limiting the effectiveness of AI agents. #AI #TechChallenges #ContextLake #Innovation #Productivity 👩‍💻🔍💡
Source: The New Stack
Monica White

Google ranks the best AI for building Android apps, and the winner isn’t Gemini

2026-05-26 17:32
Google has launched its Android Bench portal to help developers identify the best AI models for Android app development. This tool features a leaderboard that tracks performance, latency, tokens, and cost of various models. The latest update reveals that GPT 5.5 is currently the top AI model for building Android apps, surpassing Gemini 3.1 Pro and GPT 5.4. This initiative aims to enhance app quality across the Android ecosystem by providing clear benchmarks for AI performance....
Source: The New Stack
Adrian Bridgwater

Google pushes Pro, Ultra, and free users from open-source Gemini CLI to closed-source Antigravity CLI

2026-05-26 17:00
📢 Google has announced significant changes to its CLI offerings at Google I/O. Starting June 18, users of the open-source Gemini CLI, including Pro, Ultra, and free versions, will transition to the closed-source Antigravity CLI. This shift raises concerns as Antigravity currently offers fewer features and imposes usage limits. Google claims that Antigravity is designed to meet evolving developer needs, featuring a new terminal experience and multi-agent orchestration capabilities. However,...
Source: The New Stack
Meredith Shubel