World Models & Interactive Video

StreamDiffusion is a cutting-edge, real-time diffusion pipeline that is revolutionizing the creation of interactive and generative visual content. This open-source technology offers significant performance advantages over traditional diffusion-based systems, making it a powerful tool for artists, developers, and creators. When combined with platforms like TouchDesigner, StreamDiffusion opens up a new realm of possibilities for real-time video generation, interactive installations, and immersive experiences.
At its core, StreamDiffusion is designed for speed and efficiency. Its key technical features include:
While StreamDiffusion is a powerful Python-based tool, its integration into visual development platforms like TouchDesigner can be complex. To simplify this process, Lyell Hintz (aka @DotSimulate) created StreamDiffusionTD, a TouchDesigner operator that encapsulates all of StreamDiffusion's features into a single, user-friendly component.
StreamDiffusionTD connects real-time inputs, such as audio, sensors, and camera feeds, to the StreamDiffusion pipeline, allowing for the creation of live visuals that can be manipulated in real time. The operator is designed to be transparent, exposing core parameters to give users immediate feedback and control over the creative process.
A key feature of StreamDiffusionTD is its integration with the Daydream API, which enables remote GPU inference. This means users do not need a high-end local GPU to run StreamDiffusion, making the technology more accessible to a wider range of creators. The Daydream API integration also adds several advanced features, including:
To get started with StreamDiffusionTD, you will need the following:
Operating System: Windows 10 or 11
Graphics Card: An NVIDIA graphics card with CUDA support
Software:
The installation process involves downloading the StreamDiffusion repository, installing its dependencies, and optionally installing the TensorRT SDK for NVIDIA GPUs. Once the prerequisites are met, you can drag the StreamDiffusionTD.tox file into your TouchDesigner project to get started.
StreamDiffusionTD offers three main functionalities:
By adjusting the various parameters within the StreamDiffusionTD component, users can create a wide range of unique and dynamic visuals, from audio-reactive concert visuals to camera-driven generative art for large-scale displays. The operator can also be extended and customized using Python, MIDI, OSC, and other inputs supported by TouchDesigner.
World Models & Interactive Video

A live webcam-based Playground project that reinterprets human movement into flowing clouds

Instantly restyle your live video stream with reference images and creative prompts using the Playground.

How Daydream Live brought real-time generative AI to the Burning Man webcast, turning the Playa into a global, participatory canvas.