26 araç bulundu
Grounding Virtual Intelligence in Real Life.
Video Joint Embedding Predictive Architecture.
AI ile React/Next.js UI üretimi
Experience AI-driven tabletop adventures with dynamic storytelling and multiplayer interaction.. [Contact for Pricing]
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers.
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models.
Google DeepMind'ın video üretim modeli. Ses dahil sinematik video.
grade access to Gemini models, along with a full suite of MLOps tools for building and deploying custom AI solutions.
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
VILA: On Pre-training for Visual Language Models.
3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition.
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis.
Revolutionize e-commerce with AI-driven smart search and insights.. [Contact for Pricing]
Create realistic videos using just text.
Text-to-Image generation platform.
AI-driven image generator for creative professionals.. [Freemium]
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing.
Novel View Synthesis with Video Diffusion Models.
Multimodal Diffusion for Embodied Avatar Synthesis.
Vocode is an open-source library for building voice-based LLM applications.
Açık kaynak Cursor alternatifi — tam veri kontrolü ile AI kod editörü
Void is an open source Cursor alternative. Write code with the best AI tools, retain full control over your data, and access powerful AI features.
Crafting Ready-to-Use 3D Models with AI.
Video-to-Audio Generation with Hidden Alignment.