News — aggregated AI coverage from 30+ publications

  1. 4651. AI Agents Are Here. What Now? (huggingface.co) huggingface.co · 1 year ago | discuss
  2. 4652. CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard (huggingface.co) huggingface.co · 1 year ago | discuss
  3. 4653. Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo (huggingface.co) huggingface.co · 1 year ago | discuss
  4. 4654. FACTS Grounding: A new benchmark for evaluating the factuality of large language models (deepmind.google) deepmind.google · 1 year ago | discuss
  5. 4655. OpenAI o1 and new tools for developers (openai.com) openai.com · 1 year ago | discuss
  6. 4656. Court case: Musk v. OpenAI regarding for-profit structure (openai.com) openai.com · 1 year ago | discuss
  7. 4657. Sora: Video generation model now available (openai.com) openai.com · 1 year ago | discuss
  8. 4658. Sora System Card (openai.com) openai.com · 1 year ago | discuss
  9. 4659. [OpenAI] o1 System Card: Safety evaluation and red teaming report (openai.com) openai.com · 1 year ago | discuss
  10. 4660. How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs (huggingface.co) huggingface.co · 1 year ago | discuss
  11. 4661. Morgan Stanley's use of AI in financial services evaluation (openai.com) openai.com · 1 year ago | discuss
  12. 4662. Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard (huggingface.co) huggingface.co · 1 year ago | discuss
  13. 4663. Investing in Performance: Fine-tune small models with LLM insights - a CFM case study (huggingface.co) huggingface.co · 1 year ago | discuss
  14. 4664. Open Source Developers Guide to the EU AI Act (huggingface.co) huggingface.co · 1 year ago | discuss
  15. 4665. Advancing red teaming with people and AI (openai.com) openai.com · 1 year ago | discuss
  16. 4666. Introducing the Open Leaderboard for Japanese LLMs! (huggingface.co) huggingface.co · 1 year ago | discuss
  17. 4667. Letting Large Models Debate: The First Multilingual LLM Debate Competition (huggingface.co) huggingface.co · 1 year ago | discuss
  18. 4668. Judge Arena: Benchmarking LLMs as Evaluators (huggingface.co) huggingface.co · 1 year ago | discuss
  19. 4669. Share your open ML datasets on Hugging Face Hub! (huggingface.co) huggingface.co · 1 year ago | discuss
  20. 4670. [NTIA] OpenAI comments on data center growth, resilience, and security (openai.com) openai.com · 1 year ago | discuss
  21. 4671. Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge (huggingface.co) huggingface.co · 1 year ago | discuss
  22. 4672. Introducing HUGS - Scale your AI with Open Models (huggingface.co) huggingface.co · 1 year ago | discuss
  23. 4673. Hugging Face Teams Up with Protect AI: Enhancing Model Security for the ML Community (huggingface.co) huggingface.co · 1 year ago | discuss
  24. 4674. Scaling AI-based Data Processing with Hugging Face + Dask (huggingface.co) huggingface.co · 1 year ago | discuss
  25. 4675. OpenAI and Hearst Content Partnership (openai.com) openai.com · 1 year ago | discuss
  26. 4676. Introducing the Open FinLLM Leaderboard (huggingface.co) huggingface.co · 1 year ago | discuss
  27. 4677. A Short Summary of Chinese AI Global Expansion (huggingface.co) huggingface.co · 1 year ago | discuss
  28. 4678. New funding to scale the benefits of AI (openai.com) openai.com · 1 year ago | discuss
  29. 4679. 🇨🇿 BenCzechMark - Can your LLM Understand Czech? (huggingface.co) huggingface.co · 1 year ago | discuss
  30. 4680. Exploring the Daily Papers Page on Hugging Face (huggingface.co) huggingface.co · 1 year ago | discuss