Description
Sora (OpenAI)
¿Qué es Sora?
Versiones y Modelos
Sora 2 (Septiembre 2025)
- Synchronized audio: Diálogos, sound effects, ambient noise
- Enhanced physics: Basketball rebota realísticamente, objetos persisten
- Advanced world simulation: Modela físicamente el mundo mejor
- Improved controllability: Sigue instrucciones intrincadas multi-shot
- Realistic styles: Cinematic, anime, realistic rendering
- Audio generation: Diálogos sincronizados con lip movements
- Higher quality experimental model
- Solo para ChatGPT Pro ($200/mes)
- Mejor resolución y duración
- Unlimited relaxed generations (después de 500 priority)
Sora 1.0 Turbo (Diciembre 2024)
- Mucho más rápido que preview feb 2024
- Still limited by physics/complexity
- Disponible vía API
- Mantiene acceso para usuarios existentes
Sora Original (Preview Febrero 2024)
- Demos iniciales "jaw-dropping"
- Limited red team access
- GPT-1 moment for video
- Object permanence emergió
Características Principales
Text-to-Video
- Genera videos desde descripciones texto
- Múltiples estilos: cinematic, realistic, anime
- Vertical short videos (optimizado social media)
- Duración: hasta ~20-30 segundos (no oficial)
Image-to-Video
- Anima imágenes estáticas
- Mantiene visual consistency
- Natural motion sequences
- Concept art to motion
Synchronized Audio
- Dialogue: Voces sincronizadas con lip movements
- Sound effects: Aligned con acción on-screen
- Ambient noise: Background soundscapes realistas
- High degree of realism: Audio coherente
Advanced Physics Simulation
- Basketball rebota off backboard si miss
- Models failure, not just success
- Object permanence mejorado
- Realistic motion y interactions
Multi-Shot Consistency
- Sigue instrucciones spanning multiple shots
- Persiste world state accurately
- Better continuity vs Sora 1
- Limitación: Long-form storytelling todavía challenging
Cameos Feature
- Upload yourself/others into AI videos
- Consent-based: solo tú decides quién usa tu likeness
- Revoke access anytime
- View all videos con tu character
- Works for humans, animals, objects
Controllability
- Intricate instructions multi-shot
- Camera movements controllables
- Style adjustments
- Scene composition control
- Limitación: Prompt adherence no perfecta
Social Features (Sora App)
- Feed-like functionality
- Share AI-generated videos
- Community platform
- "SlopTok" nickname por algunos usuarios
- Parental controls available
Safety & Provenance
- Visible watermark: Moving digital watermark (aunque removible por 3rd-party tools)
- C2PA Content Credentials: Embedded provenance
- Multi-modal moderation: Input prompts, output frames, audio, scenes
- Stricter teen limits: Daily generation caps
- Character consent: Explicit permission required
Pricing
ChatGPT Plus ($20/mes)
- 50 videos/mes at 480p
- O fewer videos at 720p
- Sora 2 incluido sin costo adicional
- Priority access sobre free tier
- Unlimited en sentido de no hard cap, subject to moderation
ChatGPT Pro ($200/mes)
- 500 priority videos/mes
- Sora 2 Pro model access (higher quality)
- Unlimited relaxed generations después de 500 priority
- Higher resolutions
- Longer durations
- Skip waitlist (invites)
- 10x more usage vs Plus
Free Tier (Sora 2)
- Invite-only inicialmente
- Generous limits pero compute-constrained
- Disponible en US/Canada primero
- iOS app (Android pendiente)
- Web access en sora.com después de invite
- Future: OpenAI planea option to pay por extra videos
API Pricing (Planned)
- Sora 1.0 Turbo: Ya disponible en API
- Sora 2 API: Planned, timeline TBD
- Unofficial providers: $0.10-0.50/segundo (oficial) vs $0.015-0.10 (3rd-party)
- ~$1-5 por 10-second video (oficial)
Limitaciones y Controversias
Limitaciones Técnicas
❌ Complex actions: Struggles con acciones complejas long duration
❌ Long-form consistency: Narrativas largas multi-shot difíciles
❌ Prompt adherence: "More controllable" ≠ perfect
❌ Specs no públicos: Duration/resolution/fps no documentados oficialmente
❌ Compute intensive: "Much, much more expensive" que texto/imagen
Controversias
- Usa copyrighted material by default unless opt-out
- Disney deal $1B (dic 2025): 200+ characters licenciados
- Japan's Content Overseas: demanda stop (Ghibli, Square Enix)
- MPA criticized approach (oct 2025)
- "Granular control" prometido para copyright holders
- 3rd-party tools removieron watermark 7 días después launch
- Undermines safety measures
- Nov 2024: API key leaked by testers
- Manifesto: protesta "art washing"
- OpenAI revoked access 3 horas después
- Hank Green y otros: app es AI slop
- Wired: overly similar to TikTok
- Concerns: misinformation, disinformation, scams
Restricciones de Acceso
❌ No Android: Early phase
❌ Age 18+: No disponible menores
❌ Geo-restricted: No UK, Switzerland, EEA
❌ No Team/Enterprise/Edu: Solo Plus/Pro/Business
Safety Restrictions
❌ People uploads limited: Deepfake mitigations
❌ Cameos: Explicit consent required
❌ Refusals: Multi-stage safety checks pueden rechazar
Casos de Uso
- TikTok, Reels, YouTube Shorts
- Vertical short-form content
- Viral creative videos
- Community sharing
- Concept reels
- Stylized shorts
- Pre-visualization
- Mood boards
- Product teasers
- Brand snippets
- Campaign visuals
- Explainer videos
- Rapid concepting
- Storyboarding
- Visual prototypes
- Pre-vis workflows
- Lesson visuals
- Educational explainers
- Tutorial content
- Illustrative reports
- Short films
- Creative animations
- Character-driven bits
- Dialogue-led content
- Client presentations
- Pitch decks
- Concept testing
- Iterative animation
Ventajas
✅ Synchronized audio: Único con diálogos + sound effects nativos
✅ Advanced physics: Mejor world simulation que competidores
✅ ChatGPT integration: Ecosystem único
✅ Cameos: Upload yourself/friends con consent
✅ Multi-shot control: Persist world state across shots
✅ Social app: Built-in distribution platform
✅ Safety-first: Provenance, watermarks, moderation
✅ Pro unlimited: Relaxed generations ilimitadas (Pro plan)
✅ API coming: Developer access planned
Comparación vs Competidores
- Sora 2: Mejor audio sync, OpenAI ecosystem
- Runway: More editing tools, established
- Sora 2: Better controllability
- Veo 3: Polished lip-sync, integrated audio
- Sora 2: Superior physics, audio, realism
- Pika: More accessible, user-friendly, no waitlist
- Sora 2: Audio generation, multi-shot
- Luma: Human motion quality en certain domains
Empresa
Founded: 2015
Sora Launch: Preview feb 2024 → Public dec 2024 → Sora 2 sep 2025
Access: ChatGPT Plus/Pro/Business
Regions: US, Canada (expanding)
Platforms: iOS app, Web (sora.com), API (planned)
- Diffusion transformer architecture
- Adaptation de DALL-E 3 tech
- Denoising latent diffusion model
- Transformer denoiser
- 3D patches en latent space
Key Features
Use Cases
Social media: TikTok, Reels, YouTube Shorts
Concept reels y stylized shorts
Product teasers y brand campaigns
Pre-visualization para film/video
Storyboarding y rapid concepting
Educational explainers
Tutorial content y lesson visuals
Short films con dialogue
Creative animations character-driven
Marketing explainer videos
Client presentations
Pitch decks con visual prototypes
Mood boards y concept testing
Viral creative content
Community video sharing
Iterative animation workflows
Visual storytelling
Brand snippets
Dialogue-led bits
Cameo-based content creation
Reviews de Usuarios
IAs Relacionadas

Google Gemini
Google DeepMind
Suite de modelos de IA multimodal de Google DeepMind con capacidades de texto, imagen, audio, video y código, integrada en el ecosistema de Google con agentes autónomos y razonamiento avanzado.

Midjourney
Midjourney Inc.
Generador de imágenes con IA líder en calidad artística que transforma prompts de texto en obras visuales impresionantes, con modelo V7, generación de video V1 y comunidad de 21M+ usuarios.

Stable Diffusion
Stability AI
Modelo open-source de generación de imágenes con IA de Stability AI. Incluye SD 3.5 con 8.1B parámetros, ejecutable localmente en hardware de consumo, con más de 10,000 modelos fine-tuned y licencia gratuita para uso comercial.
