Development and Evaluation of an AI-powered Video Captioning and Transcription Web App
- Thema:
- Development and Evaluation of an AI-powered Video Captioning and Transcription Web App
- Art:
- BA
- BetreuerIn:
- Thomas Schmidt
- BearbeiterIn:
- Thi Ha My Pham
- ErstgutachterIn:
- Christian Wolff
- Status:
- in Bearbeitung
- Stichworte:
- web development, video. transcription, aws, user interface design, subtitles, content creation, social media
- angelegt:
- 2024-01-08
- Antrittsvortrag:
- 2024-05-06
Hintergrund
- Increasing importance of video content online: With platforms like YouTube, TikTok, social media, and online courses, videos have become a predominant medium for communication, entertainment, and education. The shift towards video content highlights the need for innovative solutions that enhance the accessibility and usability of this media.
- The need for accessible and searchable video content: As the volume of video content continues to surge, ensuring accessibility and searchability has become paramount. Users expect not only seamless access to videos but also the ability to search and navigate through the content efficiently. This poses a challenge, particularly for individuals with hearing impairments who may rely on captions for a comprehensive understanding.
- Challenges associated with manual video captioning and transcription: Human transcription is not only time-consuming but also prone to errors. The intricate nature of spoken language, accents, and diverse content makes accurate manual transcription a daunting task. This inefficiency underscores the necessity for automated tools to streamline and enhance the video content processing workflow.
- Manual transcription processes are notorious for their time-consuming nature and susceptibility to errors. Content creators investing significant resources in manual transcription often face delays and increased costs.
- Accessibility issues for individuals with hearing impairments.
Zielsetzung der Arbeit
Develop an AI-powered Video Captioning and Transcription Web App to automate the process. - Improve accessibility to video content for users with hearing impairments. - Increase the efficiency and accuracy of video transcription. - Create a user-friendly interface for easy navigation and interaction. - Evaluate the effectiveness of the AI model in real-world scenarios.
Konkrete Aufgaben
Front-end development
- Create an intuitive and responsive user interface using HTML, CSS, and JavaScript, with a focus on accessibility and usability.
- Ensure a smooth and responsive experience across various devices.
- Integrate video playback functionality.
- Create a form for uploading video files.
AWS integration
- Set up AWS Transcribe and integrate it into the web app.
- Establish secure communication channels with AWS services.
Real-time Captioning Implement real-time captioning features using AWS Transcribe. Functional and user testing
- Ensure all features work as intended.
- Validate the accuracy of the transcription and captioning.
- Gather feedback on the user interface and overall experience.
Erwartete Vorkenntnisse
- Web development in HTML, CSS and JavaScript
- Familiarity with integrating AWS services into web applications
- Usability Testing
Weiterführende Quellen
- Mahoney, K. (2023, Juni 28). The Current State of Captioning: A Report by 3Play Media. 3Play Media. https://www.3playmedia.com/blog/the-current-state-of-captioning-a-report-by-3play-media/
- Sheth, A. (2023, November 29). Speech Recognition: AWS Transcription Platform Embraces Generative AI. Prompts Daily. https://www.neatprompts.com/p/revolutionizing-speech-recognition-aws-transcription-platform-embraces-generative-ai
- Guida, L. (2022, Juni 10). Use AWS AI and ML services to foster accessibility and inclusion of people with a visual or communication impairment | AWS Machine Learning Blog. https://aws.amazon.com/blogs/machine-learning/use-aws-ai-and-ml-services-to-foster-accessibility-and-inclusion-of-people-with-a-visual-or-communication-impairment/
- Rajamani, S., & Penmatcha, R. (2021, März 10). Translate video captions and subtitles using Amazon Translate | AWS Machine Learning Blog. Trans-late Video Captions and Subtitles Using Amazon Translate. https://aws.amazon.com/blogs/machine-learning/translate-video-captions-and-subtitles-using-amazon-translate/
- Guttikonda, S., & Saxman, P. (2023, Oktober 16). Generative AI in education: Build-ing AI solutions using course lecture content | AWS Public Sector Blog. https://aws.amazon.com/blogs/publicsector/generative-ai-education-building-ai-solutions-using-course-lecture-content/