Open-AI Model
We implemented an advanced input parsing mechanism, converting PDFs or text into a clean, syntax-error-free format suitable for processing by the OpenAI model.API
Utilizing the chat completion API endpoint of OpenAI GPT-4, we ensured coherent and engaging content by generating a structured script based on the user's input.Enhance Visual Appeal
To enhance the visual appeal of the videos, we segmented the script into manageable parts for image creation, with each segment fed into DALL-E 3 to produce corresponding visuals.High Quality Audio
Integration of Eleven Labs' technology enabled us to synthesize narration using selected voice artist models, ensuring high-quality audio narration.Subtitles
Inclusion of subtitles in the videos, along with real-time highlighting of spoken words, enhances the viewing experience and accessibility.Background Music
Additionally, users can add optional background music, enhancing the video’s auditory appeal, with seamless blending of the narrator’s voice achieved using the Py-Dub library.