Live Multimodal Language Translation System: Integrating Real-Time Text, Voice, Image, and Document Translation

Mrs. Prasanna Pabba; Ch. Yashwanth Sai; Y. Sreeja Manasa; V. Nityadeep; P. Chakridhar

PDF

Published: Sep 16, 2024

Keywords:

LMLTS, OCR, Tesseract, Tkinter, Speech Recognition, Text-To-Speech, Image Processing, Document Translation.

Mrs. Prasanna Pabba

Assistant Professor, Department of Computer Science and Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.

Ch. Yashwanth Sai

Students, Department of Computer Science and Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.

Y. Sreeja Manasa

Students, Department of Computer Science and Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.

V. Nityadeep

Students, Department of Computer Science and Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.

P. Chakridhar

Students, Department of Computer Science and Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.

Abstract

The purpose of this study is to develop a Live Multimodal Language Translation System (LMLTS) that facilitates real-time translation of text, voice, image, and document inputs, providing outputs in both text and voice formats. Utilizing advanced technologies such as Google Translate API, speech recognition, text-to-speech (TTS), and Optical Character Recognition (OCR), the system aims to break down language barriers and enhance global communication. Methodologically, the system integrates text preprocessing, speech recognition, and OCR for extracting and translating content across various input forms. The implications of this study suggest that LMLTS can serve as a cost-effective alternative to human translators, promoting effective communication and collaboration in a multilingual world.

Issue

Vol. 23 No. 01 (2024)

Section

Articles

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Article Sidebar

Main Article Content

Abstract

Article Details