Pixal3D generates high-fidelity 3D assets with PBR textures from a single image using pixel-aligned projection conditioning. It offers a three-stage cascade and low-VRAM mode for consumer GPUs.
SceneSmith uses GPT-5-powered agents to generate physically plausible 3D indoor scenes from text prompts, ready for robotics simulation without manual cleanup.
PAT3D composes a 9-stage pipeline combining LLMs, vision models, 3D asset generators, and physics simulation to produce physically plausible, simulation-ready 3D scenes from text prompts.