Lynx generates personalized videos from a single image using a frozen Diffusion Transformer with ID and Ref adapters. This modular design balances fidelity and efficiency.
StereoWorld uses binocular stereo vision cues to guide 3D-consistent stereo video generation, offering a biologically inspired approach to scene geometry understanding.
daVinci-MagiHuman uses a 15B-parameter single-stream transformer with a sandwich architecture to generate video and audio from text, achieving competitive quality and fast inference on a single H100 GPU.
heygen-com/skills offers modular AI agent skills to create digital avatars and videos from photos, integrating with multiple coding agents via an agent-driven install pattern.
LTX Video Generator for Mac runs complex AI video generation entirely on Apple Silicon by bridging native SwiftUI with a Python subprocess. It manages large models, audio-video sync, and long tasks locally.