HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
最近更新: 1天前Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.
最近更新: 3天前⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE fore...
最近更新: 5天前PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
最近更新: 5天前Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and ...
最近更新: 5天前VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
最近更新: 5天前A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis ...
最近更新: 5天前Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sharing.
最近更新: 5天前Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
最近更新: 5天前