live_api_starter.py under gemini-2 directory, keeps interrupting itself without getting interrupted #356

mingqxu7 · 2024-12-16T19:32:18Z

Description of the bug:

(gemini) mingqxu: gemini-2 % python live_api_starter.py
message > 2024-12-16 11:27:30.635 python[95931:17402150] WARNING: AVCaptureDeviceTypeExternal is deprecated for Continuity Cameras. Please use AVCaptureDeviceTypeContinuityCamera and add NSCameraUseContinuityCameraDeviceType to your Info.plist.
Turn complete
Turn complete
Turn complete
Turn complete
Turn complete
Turn complete
Turn complete
Turn complete
Turn complete

It breaks up when responding to me when conversing with me, as if it was interrupted.

Actual vs expected behavior:

That it will complete its utterance without stopping.

Any other information you'd like to share?

I am running it on Mac M1, using miniforge python. The python version is 3.12.

Giom-V · 2024-12-17T08:04:18Z

We just updated the script, could you try again and tell me if that still happens?

timmy59100 · 2024-12-17T17:14:10Z

Same issue for me with the latest version of the script.

mingqxu7 · 2024-12-18T08:13:24Z

Still having the same issue with the latest version of the script.

Giom-V · 2024-12-18T10:30:46Z

Are you using headphones or speakers? One issue that we're realized is that most browsers have built-in echo cancellation, which is why is works with a speaker on the AI Studio website. But when you run it on your own you don't have that by default. Depending on your OS you should check what's the best way to do it (https://docs.pipewire.org/page_module_echo_cancel.html for Linux for ex.).

mingqxu7 · 2024-12-19T08:22:47Z

yes, putting on a headphones works

sl-knowledge · 2024-12-20T17:33:34Z

I adapted the example code live_api_starter.py so it can run in iMac Chrome/edge browser with its external mic and speakers. However, it got the echo effect, while AI studio website running in the same machine and browser is fine. So I think it is probably not the noise cancelling function of browser matters here.

So for audio stream, most of time AI just replied with answers in very few sentences and then stop to ask user questions which is quite annoying. It makes it not practical to use in daily life. Even for testing purpose, it makes me lost interest to further test it.

Giom-V · 2024-12-21T22:15:09Z

But have you updated the code to use the built-in echo cancellation from the browser?
Genini tells me to try this:

navigator.mediaDevices.getUserMedia({
    audio: {
        echoCancellation: true // Explicitly request echo cancellation
    },
    video: true
})

sl-knowledge · 2024-12-22T20:22:11Z

Yes. My code has set echoCancellation true already. Echo is still the issue. So now I need to use Bluetooth speaker away from iMac for communication with AI. BTW now Gemini can reply with more words before ask questions, and I feel pleasant to talk to it now.

const stream = await navigator.mediaDevices.getUserMedia({
audio: {
channelCount: 1,
sampleRate: 16000,
echoCancellation: true,
noiseSuppression: true,
autoGainControl: true
}
});

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

live_api_starter.py under gemini-2 directory, keeps interrupting itself without getting interrupted #356

live_api_starter.py under gemini-2 directory, keeps interrupting itself without getting interrupted #356

mingqxu7 commented Dec 16, 2024

Giom-V commented Dec 17, 2024

timmy59100 commented Dec 17, 2024

mingqxu7 commented Dec 18, 2024

Giom-V commented Dec 18, 2024

mingqxu7 commented Dec 19, 2024

sl-knowledge commented Dec 20, 2024 •

edited

Loading

Giom-V commented Dec 21, 2024 •

edited

Loading

sl-knowledge commented Dec 22, 2024 •

edited

Loading

live_api_starter.py under gemini-2 directory, keeps interrupting itself without getting interrupted #356

live_api_starter.py under gemini-2 directory, keeps interrupting itself without getting interrupted #356

Comments

mingqxu7 commented Dec 16, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

Giom-V commented Dec 17, 2024

timmy59100 commented Dec 17, 2024

mingqxu7 commented Dec 18, 2024

Giom-V commented Dec 18, 2024

mingqxu7 commented Dec 19, 2024

sl-knowledge commented Dec 20, 2024 • edited Loading

Giom-V commented Dec 21, 2024 • edited Loading

sl-knowledge commented Dec 22, 2024 • edited Loading

sl-knowledge commented Dec 20, 2024 •

edited

Loading

Giom-V commented Dec 21, 2024 •

edited

Loading

sl-knowledge commented Dec 22, 2024 •

edited

Loading