Audio recording enhacement #1341

nishant-sg · 2024-03-21T18:00:02Z

Before:
The user could only record for a fix 5 seconds

After:
As long as the user is talking everything will be recorded. When the user stops talking it will take a three seconds to stop recording.

Print statements have been removed, for demo purpose please look at the video

demo-record-until-silent-enhancement.mp4

rmackay9 · 2024-03-26T07:07:53Z

MAVProxy/modules/mavproxy_chat/chat_voice_to_text.py

@@ -15,6 +17,15 @@
    print("chat: failed to import pyaudio, wave or openai.  See https://ardupilot.org/mavproxy/docs/modules/chat.html")
    exit()

+def rms( data ):


Hi, thanks for this. A few things to fix here:

could you add comments above the function?

I think we should name the function to be more specific so its purpose is clear. Perhaps calc_audio_volume() and maybe it should return the decibels directly instead of making the caller do that.

our normal style is no spaces after brackets. So change "def rms( data )" to "def rms(data)"

I think it's possible that count could be zero if the microphone is not working. In any case we should have protection against divide-by-zero which could happen if count is zero.

rmackay9 · 2024-03-26T07:09:11Z

MAVProxy/modules/mavproxy_chat/chat_voice_to_text.py

-        while curr_time < time_stop:
+
+        # logic for recording sound until someone is speaking.
+        isSpeaking = True


Hi, I think our normal style is to use underscores between variables so let's change "isSpeaking" to "is_speaking".

Also maybe change the comment to be "record sound while user is speaking"

rmackay9 · 2024-03-26T07:09:45Z

MAVProxy/modules/mavproxy_chat/chat_voice_to_text.py

+            rms1 = rms(data)
+            if rms1!=0.0:
+                decibel = 20 * math.log10(rms1)
+                isSpeaking = decibel>-80.0  # -80 is the hardcoded threshold. higher number means louder. Set threshold in the range (-100,0)


If possible let's move this -80 to be a definition at the top of the file where it will be easier to find.

rmackay9 · 2024-03-26T07:11:33Z

MAVProxy/modules/mavproxy_chat/chat_voice_to_text.py

@@ -34,7 +45,7 @@ def check_connection(self):
            try:
                self.client = OpenAI()
            except Exception:
-                print("chat: failed to connect to OpenAI")
+                print("chat: failed to connect to OpenAI - 4")


I think this change to the print statement can be removed.

nishant-sg added 2 commits March 18, 2024 22:22

enhancement- record voice as long as you are speaking.

bc1bb92

removed print statements

94e4241

peterbarker requested a review from rmackay9 March 22, 2024 04:22

rmackay9 reviewed Mar 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio recording enhacement #1341

Audio recording enhacement #1341

nishant-sg commented Mar 21, 2024

rmackay9 Mar 26, 2024

rmackay9 Mar 26, 2024

rmackay9 Mar 26, 2024

rmackay9 Mar 26, 2024

Audio recording enhacement #1341

Are you sure you want to change the base?

Audio recording enhacement #1341

Conversation

nishant-sg commented Mar 21, 2024

rmackay9 Mar 26, 2024

Choose a reason for hiding this comment

rmackay9 Mar 26, 2024

Choose a reason for hiding this comment

rmackay9 Mar 26, 2024

Choose a reason for hiding this comment

rmackay9 Mar 26, 2024

Choose a reason for hiding this comment