Articles from Source: Hugging-Face-Blog

Differential Transformer V2

2026-01-20 03:20
Introducing Differential Transformer V2 (DIFF V2) – an enhanced version of its predecessor, DIFF V1. This update emphasizes improved inference efficiency, making it a valuable tool for various applications. For more details, check the full article. πŸ“ˆπŸ” #DifferentialTransformer #AI #Efficiency #TechInnovation
Source: Hugging Face Blog

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

2026-01-20 00:00
πŸš€ Overworld has launched Waypoint-1, a new tool for real-time interactive video diffusion. This innovative technology allows users to engage with video content more dynamically, enhancing the overall viewing experience. Developers can explore the project's details and updates on GitHub. #Waypoint1 #VideoTechnology #Innovation #Overworld #RealTimeVideo
Source: Hugging Face Blog

Introducing OptiMind, a research model designed for optimization

2026-01-15 18:49
πŸš€ Introducing OptiMind, a new research model by Microsoft Research aimed at streamlining optimization workflows. This innovative tool translates natural language problem descriptions into formal mathematical models, reducing time and expertise needed for formulation. OptiMind enhances efficiency in optimization tasks. #Optimization #AI #MicrosoftResearch #MachineLearning #Innovation
Source: Hugging Face Blog

Open Responses: What you need to know

2026-01-15 00:00
🌟 OpenAI has introduced Open Responses, a new open inference standard aimed at enhancing AI agents. This initiative, supported by the open-source community and Hugging Face, addresses the limitations of the current Chat Completion format, which is not suited for autonomous systems. The goal is to collaborate with the community to establish a shared format that can effectively replace chat completions over time. Stay tuned for updates! πŸ”πŸ€– #OpenAI #OpenResponses #AIAgents #TechInnovation...
Source: Hugging Face Blog

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

2026-01-06 21:16
πŸ“Š New research highlights the Llama Nemotron RAG models, showcasing their potential to enhance accuracy in multimodal search and visual document retrieval. These advanced models demonstrate improved performance across various data types, making them a valuable asset for enterprises. Explore how these innovations can transform information retrieval! πŸ€–βœ¨ #TechInnovation #DataRetrieval #MultimodalSearch #AI #MachineLearning
Source: Hugging Face Blog

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

2026-01-05 22:56
πŸš€ NVIDIA has launched Cosmos Reason 2, enhancing reasoning capabilities for physical AI. This new model improves accuracy and leads the Physical AI Bench and Physical Reasoning rankings, making it the top choice for visual understanding. Stay updated with advancements in AI technology! πŸ€–βœ¨ #NVIDIA #AI #CosmosReason2 #Technology #Innovation
Source: Hugging Face Blog

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

2026-01-05 22:04
πŸš€ Exciting advancements in voice AI! A new model, NVIDIA Nemotron Speech ASR, enhances real-time voice interactions by addressing the speed vs. accuracy challenge. This system utilizes cache-aware technology to process only new audio, achieving up to 3x efficiency compared to traditional methods. The article highlights its real-world applications with Daily and Modal for improved performance in high-demand environments. #VoiceAI #AutomaticSpeechRecognition #NVIDIA #TechInnovation #Efficiency
Source: Hugging Face Blog

Introducing Falcon H1R 7B

2026-01-05 09:26
πŸš€ Introducing Falcon H1R 7B! Developed by the Technology Innovation Institute in Abu Dhabi, this decoder-only large language model showcases impressive reasoning capabilities despite its 7 billion parameters. It matches or exceeds the performance of larger models, thanks to a curated training set and an efficient fine-tuning process. Key focuses include speed, token efficiency, and accuracy. Explore more about this advancement in AI! #AI #LanguageModel #TechnologyInnovation #FalconH1R7B...
Source: Hugging Face Blog

Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture

2026-01-05 09:16
πŸš€ Exciting news in AI! Introducing Falcon-H1-Arabic, a new family of advanced Arabic language models. This release marks a significant advancement in architecture and capabilities for natural language processing. Developed through extensive research and community feedback, Falcon-H1-Arabic sets new standards in the field. Learn more in our official blog! 🌍✨ #AI #ArabicLanguage #NaturalLanguageProcessing #Innovation #TechNews
Source: Hugging Face Blog

NVIDIA brings agents to life with DGX Spark and Reachy Mini

2026-01-05 00:00
πŸš€ NVIDIA has unveiled exciting advancements at CES 2026, introducing new open models for AI agents. Key highlights include the NVIDIA Nemotron reasoning LLMs and the NVIDIA Isaac GR00T N1.6. These tools empower developers to create personalized AI agents. With the NVIDIA DGX Spark and Reachy Mini, you can now bring your own AI assistant to life right at your desk. This allows for private data processing and collaboration. For a detailed guide, check out the blog post. #NVIDIA #AI #CES2026...
Source: Hugging Face Blog

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

2025-12-23 14:07
Introducing AprielGuard, a new safety and security model for Large Language Models (LLMs). πŸ”’ This 8B parameter model addresses 16 categories of safety risks, including toxicity and misinformation, and detects various adversarial attacks like prompt injections and context hijacking. AprielGuard operates in both reasoning and non-reasoning modes, offering flexibility for different applications. #AI #LLM #Safety #Cybersecurity #AprielGuard
Source: Hugging Face Blog

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

2025-12-18 00:00
πŸš€ Exciting updates in Transformers v5! The redesign of tokenizers separates their design from trained vocabulary, allowing for easier inspection and customization. This version features clearer internals, a streamlined class hierarchy, and a unified backend. For those looking to understand or train model-specific tokenizers, this blog serves as a practical guide. #Transformers #Tokenization #AI #MachineLearning #DataScience
Source: Hugging Face Blog

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

2025-12-17 13:22
NVIDIA has introduced an open evaluation standard with the Nemotron 3 Nano model. This initiative aims to enhance transparency in AI model assessments. By sharing the complete evaluation recipe using the NeMo Evaluator library, NVIDIA allows for independent verification of results. This approach addresses concerns about the authenticity of model improvements. Open innovation is emphasized as crucial for AI advancement. Providing detailed evaluation information helps ensure accountability in...
Source: Hugging Face Blog

CUGA on Hugging Face: Democratizing Configurable AI Agents

2025-12-15 16:01
CUGA is now available on Hugging Face, aiming to simplify the development of configurable AI agents. πŸ€– This platform tackles common issues like brittleness and tool misuse, enabling the creation of robust agents that can adapt across various domains. CUGA offers a solution to the challenges developers face when building intelligent applications. πŸ“ˆ #AI #MachineLearning #HuggingFace #CUGA #TechNews
Source: Hugging Face Blog

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

2025-12-15 14:08
πŸš€ The Nemotron 3 Nano is setting a new benchmark for efficient and intelligent agentic models in 2026! This model addresses the challenges of balancing speed and accuracy. Smaller models may be quick but often lack depth, while larger ones are accurate yet costly. NVIDIA's solution features a hybrid Mamba-Transformer architecture, boasting a 1M-token context window to support high-throughput agents. #AI #NVIDIA #AgenticModels #TechInnovation #MachineLearning
Source: Hugging Face Blog

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

2025-12-05 00:00
πŸš€ Exciting news for developers! Introducing **swift-huggingface**, a new Swift package that serves as a complete client for the Hugging Face Hub. This package is available now and will soon integrate with swift-transformers. πŸ› οΈ The update addresses key issues from the community, including slow downloads, unreliable large file transfers, and a lack of shared cache with Python. Authentication methods have also been clarified for better usability. #HuggingFace #SwiftProgramming #AI...
Source: Hugging Face Blog

We Got Claude to Fine-Tune an Open Source LLM

2025-12-04 00:00
πŸš€ Exciting news for AI developers! Claude now has the ability to fine-tune open-source language models with the new Hugging Face Skills tool. This feature allows users to write training scripts, submit jobs to cloud GPUs, and monitor progress seamlessly. The hf-llm-trainer skill equips Claude with knowledge on model training, GPU selection, and configuration. Learn how to leverage this tool to enhance your AI projects! #ArtificialIntelligence #MachineLearning #OpenSource #HuggingFace #ClaudeAI
Source: Hugging Face Blog

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

2025-12-02 18:50
Explore how custom policy enforcement enhances AI applications. The article discusses the integration of reasoning mechanisms that improve both speed and safety in AI systems. Key insights include the importance of tailored policies to meet specific needs and the role of reasoning in decision-making processes. Stay informed about the future of AI! πŸ€–πŸ” #AI #PolicyEnforcement #TechInnovation #Safety #MachineLearning
Source: Hugging Face Blog

SARLO-80: Worldwide Slant SAR Language Optic Dataset at 80 cm Resolution

2025-12-01 10:10
🌍 Introducing SARLO-80, a new global dataset focused on slant synthetic aperture radar (SAR) imagery at 80 cm resolution. This dataset is designed to enhance machine learning applications in remote sensing. It provides a comprehensive resource for researchers and developers in the field. πŸ“Š Explore how SARLO-80 can advance your projects! #SARData #RemoteSensing #MachineLearning #DataScience #SARLO80
Source: Hugging Face Blog

Transformers v5: Simple model definitions powering the AI ecosystem

2025-12-01 00:00
πŸš€ Exciting news in the AI world! The release of Transformers v5.0.0rc-0 marks a significant milestone, evolving from just 40 model architectures to over 400. In just five years, daily installations have soared from 20,000 to over 3 million! πŸ“ˆ The community has also contributed more than 750,000 model checkpoints, enhancing collaboration within the ecosystem. Key focuses for v5 include simplicity, training, inference, and production efficiency. The library continues to adapt, ensuring its...
Source: Hugging Face Blog

Continuous batching from first principles

2025-11-25 00:00
πŸš€ Continuous batching is a key method for improving AI chatbot performance. It enhances throughput by processing multiple conversations simultaneously, which is crucial for high-demand scenarios. Starting from attention mechanisms and KV caching, this optimization allows for faster token generation, making user interactions smoother. Learn more about how these techniques work together to optimize AI responses. #AI #Chatbots #ContinuousBatching #MachineLearning #TechInsights
Source: Hugging Face Blog

Diffusers welcomes FLUX-2

2025-11-25 00:00
🌟 Exciting news in the world of image generation! Black Forest Labs has introduced FLUX.2, a new series of image generation models that features a fresh architecture and has been pre-trained from scratch. This model is distinct from its predecessor, FLUX.1, and is not intended as a direct replacement. The article discusses key updates, how to perform inference with FLUX.2, and the process of LoRA fine-tuning. Stay tuned for more advancements! #ImageGeneration #AI #MachineLearning #FLUX2...
Source: Hugging Face Blog

20x Faster TRL Fine-tuning with RapidFire AI

2025-11-21 00:00
πŸš€ Exciting news for TRL users! Hugging Face TRL now integrates with RapidFire AI, allowing for 20x faster fine-tuning and post-training experiments. This integration helps streamline the process of comparing multiple configurations without extensive code changes or increased GPU demands. πŸ” With RapidFire AI, teams can run various TRL configurations concurrently, even on a single GPU. This adaptive scheduling enhances efficiency and can improve evaluation metrics significantly. #AI...
Source: Hugging Face Blog

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

2025-11-21 00:00
The Open ASR Leaderboard has recently updated to include new multilingual and long-form transcription tracks. πŸ“Š Currently, there are 150 Audio-Text-to-Text and 27K ASR models available. Many benchmarks focus on short-form English, often missing key areas like multilingual performance and model throughput for long-form audio. The leaderboard has become a standard for comparing model accuracy and efficiency. πŸ” #ASR #MachineLearning #AI #TechUpdates #DataScience
Source: Hugging Face Blog

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

2025-11-20 00:00
πŸš€ Exciting news for Apple developers! Introducing AnyLanguageModel, a new Swift package that simplifies the integration of local and remote LLMs. This tool offers a single API that replaces Apple’s Foundation Models framework, streamlining the process for AI-powered app development. Developers often face challenges with multiple APIs and integration patterns. AnyLanguageModel aims to reduce this friction, encouraging the use of local, open-source models. For more details, check the update on...
Source: Hugging Face Blog

Easily Build and Share ROCm Kernels with Hugging Face

2025-11-17 00:00
πŸš€ Build and share ROCm kernels easily with Hugging Face! Custom kernels enhance deep learning performance, allowing tailored GPU operations. However, compiling them can be challenging due to various complexities. Hugging Face's kernel-builder simplifies this process, supporting multiple backends like ROCm. This guide focuses on building efficient kernels for AMD GPUs, covering best practices for testing and deployment. #DeepLearning #ROCm #HuggingFace #GPU #TechInnovation
Source: Hugging Face Blog

Building for an Open Future - our new partnership with Google Cloud

2025-11-13 00:00
πŸš€ Exciting news! Hugging Face has announced a new partnership with Google Cloud to help companies build their own AI using open models. Jeff Boudier emphasizes the importance of this collaboration, stating it simplifies AI customization on Google Cloud. 🌐 With over 2 million open models available, this partnership aims to enhance accessibility and development for businesses worldwide. πŸ€– #AI #GoogleCloud #OpenModels #HuggingFace #Partnership
Source: Hugging Face Blog

Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac

2025-10-29 00:00
Discover how NVIDIA Isaac is transforming healthcare robotics! πŸ€– The article details the journey of developing a healthcare robot, starting from simulation to real-world deployment. It highlights the use of advanced AI and simulation tools to enhance the robot's capabilities in patient care. This innovation promises to improve efficiency in healthcare settings. #HealthcareInnovation #Robotics #NVIDIAIsaac #AI #Technology
Source: Hugging Face Blog

Voice Cloning with Consent

2025-10-28 00:00
🌐 Exciting advancements in voice technology are here! A recent blog post discusses the concept of a 'voice consent gate' aimed at promoting ethical voice cloning. This initiative seeks to ensure that voice cloning is performed with consent, helping to mitigate risks associated with misuse, like deepfakes. The article highlights both the advantages and concerns of voice generation, noting its potential to assist those who have lost their ability to speak and support language learning. For more...
Source: Hugging Face Blog

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

2025-10-27 00:00
πŸš€ Exciting news in open machine learning! Huggingface_hub has officially launched v1.0 after five years of development. This update supports over 200,000 libraries and provides access to 2 million models, 0.5 million datasets, and 1 million Spaces. Key changes include a new backend with httpx and an improved CLI interface. Upgrade now to enhance your experience! #HuggingFace #MachineLearning #OpenSource #AI #TechUpdate
Source: Hugging Face Blog

Streaming datasets: 100x More Efficient

2025-10-27 00:00
Discover the latest advancements in streaming datasets, reported to be 100 times more efficient than traditional methods. πŸ“Š This breakthrough can significantly enhance data processing and real-time analytics across various industries. For developers and data scientists, this opens new avenues for improved performance and resource management. Stay updated on tech innovations! πŸš€πŸ’» #DataScience #StreamingData #TechInnovation #Efficiency #Analytics
Source: Hugging Face Blog

LeRobot v0.4.0: Super Charging OSS Robotics Learning

2025-10-24 00:00
πŸš€ Exciting updates in LeRobot v0.4.0 enhance open-source robotics learning! This release introduces scalable Datasets v3.0 and powerful new VLA models, including PI0.5 and GR00T N1.5. A new plugin system simplifies hardware integration, while support for LIBERO and Meta-World simulations broadens capabilities. Key features also include simplified multi-GPU training and a new Hugging Face collaboration. #OpenSource #Robotics #AI #TechUpdates #LeRobot
Source: Hugging Face Blog

Building the Open Agent Ecosystem Together: Introducing OpenEnv

2025-10-23 00:00
🌐 Exciting news in the tech world! Meta and Hugging Face have announced the launch of the OpenEnv Hub, aimed at fostering a shared community for agentic environments. πŸ› οΈ This initiative highlights the importance of both compute infrastructure and the developer community in scaling AI. 🧩 Agentic environments provide the necessary tools and context for agents to perform tasks safely and effectively. #OpenEnv #AI #TechCommunity #Meta #HuggingFace
Source: Hugging Face Blog

Hugging Face and VirusTotal collaborate to strengthen AI security

2025-10-22 00:00
πŸš€ Exciting news in AI security! Hugging Face has teamed up with VirusTotal to enhance the safety of files on the Hugging Face Hub. πŸ” This partnership ensures that over 2.2 million public model and dataset repositories are continuously scanned for potential threats, protecting the machine learning community. πŸ›‘οΈ AI models can carry risks, from disguised malicious files to compromised assets. With VirusTotal’s trusted malware intelligence, users gain an additional layer of security. #AISecurity...
Source: Hugging Face Blog

Sentence Transformers is joining Hugging Face!

2025-10-22 00:00
πŸš€ Exciting news! Sentence Transformers is now part of Hugging Face! This transition from the UKP Lab at TU Darmstadt to Hugging Face marks a new chapter for the popular library, which has been maintained by Tom Aarsen since late 2023. With Hugging Face's robust infrastructure, Sentence Transformers will enhance its capabilities in Information Retrieval and NLP. πŸ’‘ Since 2019, it has offered over 16,000 models for tasks like semantic search and clustering, serving a million users monthly....
Source: Hugging Face Blog

Supercharge your OCR Pipelines with Open Models

2025-10-21 00:00
πŸš€ Supercharge your OCR pipelines with open models! The rise of vision-language models is changing the document AI landscape. This guide helps you navigate the choices, focusing on cost efficiency and privacy with open-weight models. Learn when to fine-tune models, key selection factors, and how to enhance OCR with multimodal retrieval and document QA. Get started today! πŸ“„πŸ” #OCR #DocumentAI #OpenModels #TechGuide #AIInsights
Source: Hugging Face Blog

Unlock the power of images with AI Sheets

2025-10-21 00:00
Unlock the power of images with AI Sheets! πŸ“Έβœ¨ This article discusses how to generate and transform text and images seamlessly. It provides a step-by-step guide for users to enhance their visual content effortlessly. Stay ahead in your creative projects by leveraging AI tools for better storytelling through images. #AISheets #ImageTransformation #TechInnovation #CreativeTools #VisualStorytelling
Source: Hugging Face Blog

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

2025-10-16 00:00
πŸš€ Google Cloud's new C4 Virtual Machine showcases a significant 70% improvement in Total Cost of Ownership (TCO) for OpenAI's GPT OSS, thanks to collaboration with Intel and Hugging Face. Key findings include a 1.7x TCO improvement over the previous C3 VM. The C4 VM also offers enhanced throughput and a lower hourly rate. #GoogleCloud #Intel #HuggingFace #AI #CloudComputing
Source: Hugging Face Blog

Get your VLM running in 3 simple steps on Intel CPUs

2025-10-15 00:00
Unlock the potential of Vision Language Models (VLMs) on Intel CPUs! 🌐 This article outlines three simple steps to set up a VLM locally. You can analyze images, generate captions, and answer visual content questionsβ€”all while keeping your data private and secure. πŸ”’ Tools like Optimum Intel and OpenVINO make this process accessible, even without high-end hardware. Check out the full guide for a seamless setup! πŸ–₯️ #AI #VisionLanguageModels #DataPrivacy #Intel #TechGuide
Source: Hugging Face Blog

SOTA OCR on-device with Core ML and dots.ocr

2025-10-02 00:00
πŸš€ Exciting advancements in on-device OCR technology have emerged with dots.ocr from RedNote, a model outperforming Gemini 2.5 Pro. This 3B parameter model is designed for seamless on-device performance, eliminating the need for API keys and internet access. Key to this is Apple's Neural Engine, which offers impressive power efficiency. πŸ”‹ However, converting models to Core ML can be challenging. Apple also provides MLX for GPU targeting, enhancing flexibility. Stay tuned for a three-part...
Source: Hugging Face Blog