Multimodal perception model
Perception-1
Perception-1 is Snapy.ai's media understanding layer. It helps interpret speech, pacing, scene context, and audio-visual structure so editing workflows can make stronger decisions.
- Multimodal video and audio understanding
- Scene, speech, and pacing awareness
- Supports clip selection and workflow orchestration