Combining Audio Chunks Received via WebSockets into a Single Audio File in Django

BJoshi · July 19, 2024, 10:06am

I’m working on a Django project where I need to collect streaming audio chunks from the frontend via WebSockets, and then combine these chunks on the backend to create a single audio file. My current setup involves streamin audio on the frontend and sending it to the backend in small chunks. After creating audio file in backend I want to transcript audio file to text.

KenWhitesell · July 19, 2024, 10:51am

Welcome @BJoshi !

That sounds like an interesting project.

BJoshi · July 19, 2024, 11:03am

Thank you for reply but I need solution of this because i already search and do R&D about it But still i didn’t found any solution on it.

KenWhitesell · July 19, 2024, 11:06am

So what part of this do you have questions about? The more specific you are with identifying where you are stuck, the better chance we have of providing assistance.

BJoshi · July 19, 2024, 12:56pm

    
    async def connect(self):
        await self.accept()
        self.model = whisper.load_model('base')
        # self.audio_chunk = []

    async def disconnect(self, close_code):
        await self.close()

    async def receive(self, text_data=None, bytes_data=None):
        if bytes_data:
            try:
                file_name = 'test.wav'
                audio = AudioSegment.from_file(io.BytesIO(bytes_data), format='webm')
                audio.export(file_name, format="wav")

                text = await speech_to_text(file_name)
                print(text,'text')
            except Exception as e:
                await self.send(text_data=json.dumps({
                    'error': f"Error: {str(e)}"
                }))

async def speech_to_text(file_name):

    recognizer = sr.Recognizer()
    
    with sr.AudioFile(file_name) as source:
        audio_data = recognizer.record(source)
    try:
        text = recognizer.recognize_google(audio_data)
        print(text,'text')
        return text
    except sr.UnknownValueError:
        return "Could not understand the audio"
    except sr.RequestError as e:
        return f"Could not request results from the speech recognition service; {e}"

Here is my consumers.py file

BJoshi · July 19, 2024, 12:57pm

                    const stream = await navigator.mediaDevices.getUserMedia({audio:true});

                    if (!MediaRecorder.isTypeSupported('audio/webm')) {
                        alert('Browser not supported for MediaRecorder');
                        return;
                    }

                    const mediaRecorder = new MediaRecorder(stream, {
                        mimeType: 'audio/webm',
                    });

                    const socket = new WebSocket('ws://' + window.location.host + '/ws/chat/');

                    socket.onopen = () => {
                        document.querySelector('#status').textContent = 'Connected';
                        console.log({ event: 'onopen' });

                        mediaRecorder.addEventListener('dataavailable', async (event) => {
                            if (event.data.size > 0 && socket.readyState === 1) {
                                    socket.send(array);
                            }
                        });
                        mediaRecorder.start(1000);
                    };

Here is my javascript code in which i get audio and send it via websocket

KenWhitesell · July 19, 2024, 12:58pm

Please do not post images of code here. Copy/paste the code into the body of your post.

When posting code here, enclose the code between lines of three backtick - ` characters. This means you’ll have a line of ```, then your code, then another line of ```. This forces the forum software to keep your code properly formatted.

Please delete these images and post the code you would like us to examine.

Also, don’t just post the code. Be specific and identify what the issue or problem is that you would like assistance with.

BJoshi · July 19, 2024, 12:59pm

ohk thank you for information

BJoshi · July 19, 2024, 1:07pm

In this code i want to create audio file of bytes_data i send throw javascript and transcript this audio file in text.
In short term I want to transcribe real-time audio into text if you have any other option beside my code then provide me.
I don’t want to use any paid services.

KenWhitesell · July 19, 2024, 1:25pm

Sorry, I am not familiar with any such software.

BJoshi · July 19, 2024, 1:36pm

ohk, have you seen my code ?

Need to improve code ?

kenilJoshi · November 9, 2024, 8:16am

Use ffmpeg
Its good and i was also facing same issue

Topic		Replies	Views
How to implement consumer for voice chat app on Django? Getting Started	6	2998	February 5, 2024
How to name a audio file when saving into server with POST Using Django	6	3174	April 5, 2021
Django channels async long lived events Async/Channels	8	4322	August 26, 2022
Run Django app and also connect to a websocket Using Django	8	3519	June 12, 2020
run Django app with tornado Using Django	3	630	June 20, 2020

Combining Audio Chunks Received via WebSockets into a Single Audio File in Django

Related topics