C
ChaoBro

Open Source

AI open-source projects worth tracking and trying

Open Source

LMSYS P2P Weight Transfer: Second-Level Sync for 1T Parameter RL Training

LMSYS team introduces RDMA-based P2P weight update mechanism in SGLang as a supplement to traditional NCCL broadcast. Weight synchronization for trillion-parameter RL training compressed from minutes to seconds, compatible with all major open-source models.

#LMSYS #SGLang #RL Training
Open Source

OpenUI: An Open Standard for Generative UI

thesysdev/openui proposes an open standard protocol for generative UI. 4.5K stars, 705 stars/week. AI-generated UI is no longer driven by prompt guessing but by structured protocols. The frontend development workflow is being rewritten.

#OpenUI #Generative UI #Frontend
Open Source

fal genmedia CLI: Generate Images, Video, 3D, and Audio from the Terminal

fal.ai launched genmedia CLI to generate images, video, 3D, and audio directly from the terminal. Native terminal workflow, no Dashboard needed, directly integrable into Claude Code, Cursor and other AI Agent scripts and pipelines. 33K+ views and 190 bookmarks on day one.

#fal.ai #CLI #Generative Media
Open Source

Hugging Face Built a 110M Parameter DeepSeek-V4 Micro Clone

Hugging Face released nanowhale, a 110M parameter model replicating DeepSeek-V4 core architectures including MLA, MoE, and MTP. Benchmarks are not the point; its value is enabling low-cost architecture research.

#Hugging Face #DeepSeek #MoE
Open Source

MLX-VLM: Running Vision Language Models Locally on Mac

MLX-VLM, built on Apple MLX framework, enables Mac users to run and fine-tune vision language models locally. 51 new stars today, an important tool for edge multimodal inference.

#MLX #Vision Language Model #Apple Silicon
Open Source

Rapid-MLX: 4.2x Faster Than Ollama for Local AI on Mac, But Can It Replace It?

Rapid-MLX, built on Apple native MLX framework, delivers 2-4x faster inference than Ollama on Apple Silicon, hitting 160 tok/s on 4B models. Day-0 support for Qwen3.6 and DeepSeek V4 Flash, with OpenAI-compatible API. But ecosystem and model library remain weak points.

#Rapid-MLX #MLX #Local Inference
Open Source

Anthropic Donates Petri Alignment Tool to Meridian Labs

Anthropic donates open-source alignment evaluation tool Petri to Meridian Labs for independent operation, releasing v3.0. UK AI Security Institute already uses Petri to test every Claude model.

#Anthropic #Petri #Meridian Labs
Open Source

Sulphur-2 Open Source Release: Uncensored Video Generation Model with t2v + i2v Capabilities Breaks Commercial Monopoly

Sulphur-2 is now open source on Hugging Face, supporting text-to-video (t2v) and image-to-video (i2v) generation without content moderation. As the first truly usable open-source video generation model, it directly challenges commercial closed-source alternatives like SeedDance, Kling, and Veo in creative freedom.

#Sulphur-2 #Video Generation #Open Source Model
Open Source

Scrapling: 5,600 Stars in a Week — What Makes This Adaptive Scraping Framework Tick?

Adaptive web scraping framework Scrapling gained 5,650 stars this week on GitHub, surpassing 44K total. Its core selling point is automatic handling of anti-scraping mechanisms, dynamic pages, and structural changes. This article analyzes Scrapling's technical advantages, competitor comparison, and use cases.

#Scrapling #Web Scraping #Data Collection
Open Source

LLaDA2.0-Uni Goes Open Source: Diffusion LLM Unifies Multimodal Understanding and Generation, A New Paradigm of 8-Step Image Generation

Inclusion AI has open-sourced LLaDA2.0-Uni, a diffusion LLM-based unified multimodal model that integrates vision understanding and image generation into a single architecture. Powered by a MoE backbone and SigLIP-VQ tokenizer, it generates images in just 8 steps and supports native interleaved reasoning — offering a brand-new inference paradigm for multimodal Agents.

#LLaDA #Diffusion Model #Multimodal
Open Source

ds2api: Go-Based DeepSeek-Compatible Middleware, 1,726 New Stars This Week on GitHub

CJackHwang/ds2api is a Go-based DeepSeek-compatible middleware focused on high-concurrency protocol adaptation, converting diverse web protocols into standardized DeepSeek API format. This week it gained 1,726 stars reaching 3,066 total, offering a unified API gateway solution for enterprises integrating multiple model sources.

#DeepSeek #Middleware #API Compatibility
Open Source

TradingAgents: Multi-Agent LLM Financial Trading Framework with 58K+ Stars on GitHub

TauricResearch/TradingAgents continues trending on GitHub with 58,369 stars, gaining 2,023 stars today. The project builds a multi-agent LLM financial trading framework, decomposing analysis, decision-making, and risk control into independent collaborating agents. Running alongside the Agent Arena S3 live trading competition, this project provides an open-source reference implementation for building autonomous trading agents.

#TradingAgents #Multi-Agent #Financial Trading
Open Source

Google Gemini Embedding 2 Released: First Multimodal Unified Vector Space Model

Google releases Gemini Embedding 2, the industry's first fully multimodal embedding model based on Gemini architecture, supporting unified encoding of 100+ languages across text, image, and audio into a single vector space. Available via Gemini API and Vertex AI preview. This means semantic precision for text-to-image and image-to-image search receives a generational upgrade, and multimodal fusion for RAG knowledge bases becomes possible.

#Google #Gemini #Embedding
Open Source

Pixelle-Video: Open-Source AI Fully Automated Short Video Engine

AIDC-AI/Pixelle-Video is an open-source AI fully automated short video engine supporting one-stop generation from text script to finished video. 7600 stars, 1200+ forks, providing a locally deployable automation solution for short video creators.

#Video Generation #AI Content Creation #Automation