Articles from Source: Hugging-Face-Blog

Introducing RTEB: A New Standard for Retrieval Evaluation

2025-10-01 00:00
🚀 Introducing RTEB: A New Benchmark for Retrieval Evaluation! The Retrieval Embedding Benchmark (RTEB) is designed to accurately evaluate the retrieval accuracy of embedding models. It combines open and private datasets to address the limitations of existing benchmarks. This approach aims to ensure fair and transparent measurement of how models perform on unseen data, enhancing the quality of AI applications. Learn more about RTEB and its impact on retrieval evaluation! 📊🔍 #RTEB #AI...
Source: Hugging Face Blog

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

2025-09-29 00:00
🚀 Exciting advancements in AI with the Qwen3-8B model! Recent updates reveal a significant acceleration in performance on Intel® Core™ Ultra using Depth-Pruned Draft Models. By implementing speculative decoding, generation speeds have improved by approximately 1.4x. These enhancements enable the efficient operation of a fast, local AI agent. #AI #MachineLearning #Intel #Qwen3 #OpenVINO
Source: Hugging Face Blog

VibeGame: Exploring Vibe Coding Games

2025-09-29 00:00
🎮 Discover VibeGame, a new platform focused on coding games that enhance programming skills through interactive play. The initiative aims to make coding accessible and enjoyable for all levels, encouraging creativity and problem-solving. Explore how VibeGame is shaping the future of learning in tech! 🌐💡 #VibeGame #CodingGames #TechEducation #LearnToCode #GameDevelopment
Source: Hugging Face Blog

Swift Transformers Reaches 1.0 — and Looks to the Future

2025-09-26 00:00
🚀 Swift Transformers has reached version 1.0! Launched two years ago, this library aims to support Apple developers in integrating local LLMs into their applications. The team has gathered insights from the community and is now focusing on enhancing features, particularly for MLX and agentic use cases. Stay tuned for more updates! #SwiftTransformers #AppleDevelopers #MachineLearning #TechUpdate #Innovation
Source: Hugging Face Blog

Smol2Operator: Post-Training GUI Agents for Computer Use

2025-09-23 00:00
Introducing Smol2Operator: a vision-language model that learns GUI skills and evolves into an agentic GUI coder. The project shares training recipes, data-processing tools, and demo datasets to support reproducibility and further research. Check out the full collection on GitHub! 🖥️📊🤖 #AI #MachineLearning #Research #GitHub #TechInnovation
Source: Hugging Face Blog

Gaia2 and ARE: Empowering the community to study agents

2025-09-22 00:00
🌍 Introducing Gaia2 and the Meta Agents Research Environments (ARE)! This new framework aims to empower the community in evaluating AI agents. It offers a more realistic approach to testing agent behaviors in complex scenarios, moving beyond traditional evaluation environments. Gaia2 allows for deep analysis of agent capabilities, while ARE provides customizable conditions for studying behaviors. Both tools are now available for public use! #AIEvaluation #Gaia2 #MetaAgents...
Source: Hugging Face Blog

Scaleway on Hugging Face Inference Providers 🔥

2025-09-19 00:00
🚀 Scaleway is now an official Inference Provider on the Hugging Face Hub! This addition expands serverless inference capabilities, allowing users to access a variety of models seamlessly through client SDKs for JS and Python. Explore popular models like gpt-oss and Qwen3 directly from Scaleway's Hub page. Check them out at the link below! 🔗 https://huggingface.co/scaleway #AI #MachineLearning #HuggingFace #Scaleway #InferenceProviders
Source: Hugging Face Blog

Democratizing AI Safety with RiskRubric.ai

2025-09-18 00:00
🌐 Exciting news in AI safety! The launch of RiskRubric.ai aims to enhance trust in AI models by providing standardized risk assessments. With over 500,000 models on the Hugging Face hub, users often struggle to evaluate security and privacy aspects. This initiative, led by Cloud Security Alliance and Noma Security, seeks to ensure transparent security reporting as AI adoption grows. #AISafety #RiskAssessment #AIModels #CloudSecurity #Innovation
Source: Hugging Face Blog

Public AI on Hugging Face Inference Providers 🔥

2025-09-17 00:00
🚀 Exciting news! Public AI is now a supported Inference Provider on Hugging Face Hub. This integration enhances serverless inference capabilities and simplifies access to public models from institutions like the Swiss AI Initiative and AI Singapore. Explore trending models and learn more at: https://huggingface.co/publicai #PublicAI #HuggingFace #AIModels #MachineLearning
Source: Hugging Face Blog

`LeRobotDataset`: Bringing large-scale datasets to lerobot

2025-09-16 00:00
🚀 Exciting news from the world of robotics! The LeRobotDataset team has released version 3.0, improving how large-scale datasets are managed. This new version allows multiple episodes to be packed into a single file, overcoming previous file-system limitations. It also supports streaming mode for efficient data processing. A simple utility is available to convert existing datasets to this new format. Check it out and enhance your projects! #LeRobotDataset #Robotics #DataScience...
Source: Hugging Face Blog

Visible Watermarking with Gradio

2025-09-15 00:00
✨ Introducing Visible Watermarking with Gradio! Watermarking generative AI content is becoming essential as AI-generated images, videos, and audio grow increasingly lifelike. Distinguishing real from generated content is challenging, making watermarking a vital solution. Learn more about the importance of this approach and its implementation in the latest update. #Watermarking #AI #Gradio #ContentCreation #GenerativeAI
Source: Hugging Face Blog

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

2025-09-11 00:00
OpenAI has launched the GPT-OSS series of models, featuring advancements like MXFP4 quantization and efficient kernels. 🚀 The transformers library has been upgraded significantly to enhance model loading, running, and fine-tuning. This allows for better community understanding and implementation. Key updates include Zero-build Kernels, Tensor Parallelism, and more. Explore these enhancements and their impact on future models! 🔍💻 #OpenAI #GPTOSS #Transformers #MachineLearning #AIInnovations
Source: Hugging Face Blog

Jupyter Agents: training LLMs to reason with notebooks

2025-09-10 00:00
🚀 Jupyter Agents aim to enhance LLMs by enabling code execution directly in Jupyter Notebooks. This integration helps tackle complex data science tasks more efficiently. The initiative focuses on improving smaller models to compete with larger ones through high-quality training data and fine-tuning methods. Stay tuned for updates on this innovative project! 🧠💻 #Jupyter #LLM #DataScience #AI #MachineLearning
Source: Hugging Face Blog

mmBERT: ModernBERT goes Multilingual

2025-09-09 00:00
🌐 Exciting developments in AI! The article discusses mmBERT, a new multilingual model built on ModernBERT. It aims to enhance language processing across various languages. Key features include improved understanding and generation of text in multiple languages, making it a versatile tool for global applications. For more details, check out the full article! #AI #MachineLearning #NLP #mmBERT #Multilingual
Source: Hugging Face Blog

Welcome EmbeddingGemma, Google's new efficient embedding model

2025-09-04 00:00
🚀 Exciting news in AI! Google has introduced EmbeddingGemma, a new embedding model designed for efficiency. This model aims to enhance various applications by improving performance while reducing resource consumption. Developers can explore its capabilities on GitHub. Stay tuned for more updates on AI advancements! #Google #EmbeddingGemma #AI #MachineLearning #TechInnovation
Source: Hugging Face Blog

Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation

2025-09-02 00:00
Unlock the power of ZeroGPU Spaces with PyTorch ahead-of-time (AoT) compilation! 🚀 AoT improves performance by optimizing models once for faster reloading, making demos snappier. Users can expect speed boosts ranging from 1.3× to 1.8× on various models. Explore advanced techniques like FP8 quantization and dynamic shapes for an enhanced experience. Check out live demos on the zerogpu-aoti organization! #ZeroGPU #PyTorch #MachineLearning #AI #TechInnovation
Source: Hugging Face Blog

Generate Images with Claude and Hugging Face

2025-08-19 00:00
Discover how to generate images using Claude and Hugging Face! 🤖✨ The article outlines step-by-step methods for utilizing these tools effectively. It highlights key features and provides tips for best practices in image generation. Explore the potential of AI in creative projects! #AI #ImageGeneration #HuggingFace #Claude #TechTrends
Source: Hugging Face Blog

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

2025-08-18 00:00
🚀 Excited to enhance your GPU capabilities? Check out the guide on building production-ready CUDA kernels! It introduces the kernel-builder library, designed to simplify the development of custom kernels for various architectures. Learn how to create efficient and maintainable systems while overcoming deployment challenges. Perfect for those looking to elevate their models! #CUDA #GPUs #Programming #TechGuide #MachineLearning
Source: Hugging Face Blog

MCP for Research: How to Connect AI to Research Tools

2025-08-18 00:00
🔍 Academic research often requires navigating multiple platforms like arXiv and GitHub. The Model Context Protocol (MCP) offers a solution by enabling AI to communicate with these research tools. This allows for natural language requests, streamlining the discovery process and reducing the need for manual switching between sites. Explore how MCP can enhance your research experience! 📊🤖 #AIinResearch #ModelContextProtocol #ResearchInnovation #AcademicTools
Source: Hugging Face Blog

🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?

2025-08-12 00:00
Exploring the capabilities of LLMs in Filipino languages is essential. 🇵🇭 FilBench, a new evaluation suite, assesses LLM fluency, translation, and cultural knowledge in Tagalog and Cebuano. Despite high ChatGPT usage in the Philippines, systematic evaluations are lacking. The study evaluates over 20 LLMs, aiming to provide clear insights. 📄 Read the paper: [arxiv.org](https://arxiv.org/abs/2508.03523) 🖥️ Check GitHub: [github.com](https://github.com/filbench/filbench-eval) #FilBench #LLMs...
Source: Hugging Face Blog

TextQuests: How Good are LLMs at Text-Based Video Games?

2025-08-12 00:00
📚 The article explores the capabilities of Large Language Models (LLMs) in text-based video games. It highlights that while LLMs excel in static knowledge benchmarks, they face challenges in dynamic, interactive environments. Developing effective evaluation methods for these models remains crucial. Two main approaches are suggested: real-world environments with specific skills and simulated open-world settings. The article introduces TextQuests as a new benchmark to assess LLM performance in...
Source: Hugging Face Blog

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

2025-08-08 00:00
🚀 Training large models on multiple GPUs can be complex. The article "Accelerate ND-Parallel" discusses how to simplify this process using the Accelerate library and Axolotl. It provides a step-by-step guide to integrate various parallelism strategies in your training script, enhancing efficiency. Key configurations for parallelism include Fully Sharded Data Parallel and Data Parallel degrees. This approach requires at least 2 nodes with 8 GPUs each for optimal performance. #MultiGPU...
Source: Hugging Face Blog

Introducing AI Sheets: a tool to work with datasets using open AI models!

2025-08-08 00:00
🚀 Exciting news for data enthusiasts! Hugging Face has launched AI Sheets, an open-source tool designed to build, enrich, and transform datasets using AI models without any coding required. You can deploy it locally or access it directly on the Hub, utilizing thousands of models, including gpt-oss from OpenAI. Explore the tool for free here: [AI Sheets](https://huggingface.co/spaces/aisheets/sheets) or install it locally from GitHub! #AISheets #OpenSource #DataScience #MachineLearning...
Source: Hugging Face Blog

Vision Language Model Alignment in TRL ⚡️

2025-08-07 00:00
🔍 The article discusses the alignment of Vision Language Models (VLMs) in the context of Technology Readiness Levels (TRL). It highlights the importance of aligning VLMs with real-world applications to enhance their effectiveness. 💡 The piece outlines key strategies for achieving this alignment, focusing on practical implementation and evaluation methods. For those interested in AI development, this is a valuable read! #VisionLanguageModel #AIAlignment #TechnologyReadiness #MachineLearning...
Source: Hugging Face Blog

Welcome GPT OSS, the new open-source model family from OpenAI!

2025-08-05 00:00
🚀 Exciting news from OpenAI! They have launched GPT OSS, a new open-source model family featuring two versions: gpt-oss-120b with 117B parameters and gpt-oss-20b with 21B parameters. Both models utilize a mixture-of-experts design for efficient performance. These models are licensed under Apache 2.0, promoting safe and responsible use. OpenAI aims to enhance accessibility in AI through this release. #OpenAI #GPTOSS #MachineLearning #AICommunity #OpenSource
Source: Hugging Face Blog

Build an AI Shopping Assistant with Gradio MCP Servers

2025-07-31 00:00
🚀 Python developers can supercharge their LLMs with Gradio's Model Context Protocol (MCP)! Gradio simplifies the integration of AI models from Hugging Face, enabling LLMs to tackle real-world problems. Key features include automatic conversion of functions into LLM tools, real-time notifications, and seamless file uploads. Imagine an AI shopping assistant that finds clothes for you and even shows virtual try-ons! #AI #Python #Gradio #MachineLearning #Ecommerce
Source: Hugging Face Blog