Tuesday, April 21, 2026

Tried using Gemini to create an audio voice recording on endangered animals: OK but ...

 Just tried Gemini chat to generate an audio track from my prompt, 

Please create a 45-second monologue audio track which is an organgutan appealing to human beings not to do things to put them in danger. The audio track is for upper primary ESL students' listening practice. 

BUT IT COULD ONLY GENERATE A 30-SECOND SONG. 

Then I went to Gemini's 'Canvas' function, and used the same prompt. It worked, but: 

Even though I asked for a audio track, it only gave me the script. After 3 to 4 iterations, I finally got the audio (of 45 seconds).

But after I clicked 'play', I had to wait for almost a minute for the audio to load.

I asked for a Download function. It gave it to me on the next iteration. I could download the audio as a wav file. 

I got the Share Link, and tried it in a new tab. (I had to sign in to Poe.) Again, after clicking 'play', it took a minute for the audio to load. 

The Share Link: https://gemini.google.com/share/584aa17992db

I tried the Share link in an incognito browser, it didn't work. 


No comments:

Created a poem from a picture using Gemini: It did a super job

  昨天黃昏post了東涌日落我拍的照片,跟著Francis Tsang 提議我請AI 為之寫詩;這主意從未想過,於是立指打開Gemini, 上傳那照片,寫了一個簡單的prompt, 請它寫一首詩。 不用10秒,詩出現眼前,而且真的配合照片內容,連照片內細小的cable car ...