
Rendered from selection context and anchored inline; long-press to save, regenerate, or remove.
Multimodal
Select a sentence in the novel or a reply in chat: illustration, narration and video generate in place and embed in place. Imagen and gpt-image paint the scene, TTS voices read it aloud, Veo sets it in motion — media grows inside the story instead of interrupting it.
A real interactive prototype: tap Image in the selection toolbar to watch Imagen render, Continue for streaming prose, Audio for the narration waveform. Every generation lands in the API request log with tokens and cost.
Luowen looked up as the old starport lamps went out one by one, as if someone had folded the night sky into her palm.
She finally understood the warning: every route is not only a path, but an echo.
"Do not light the seventh lamp," she whispered. "It is not a signal. Someone is asking the past for help."
The ship that vanished tonight would dock again in some other branch.
Select any sentence — continue, illustrate, narrate, or branch from right here.

Rendered from selection context and anchored inline; long-press to save, regenerate, or remove.
Narrate selections instantly or cache whole chapters offline — speed, sleep timer, paragraph skip.

Hand a signature moment to a video model and play the clip right inside the reading page.
FAQ
Imagen 3, gpt-image-1, SeedDream and more. Prompts are built from selection context, templates are editable, and size/style can be adjusted before generating.
Your own TTS providers: Qwen TTS, Grok Voice, MiMo Speech and others, with selectable voices; cached chapters play offline.
Model-dependent — typically tens of seconds to minutes. Jobs run in the background and the clip anchors back into the text when ready.
BYOK means your provider's price with no markup; tokens, cost and cache hits are recorded per request in the API log.
Reserve a spot
We send one short note when the next tester wave opens. You can also email [email protected].