WhisperFrame Depicts The Art Of Conversation

WhisperFrame Depicts The Art Of Conversation

IOTTime Stamp: September 22, 2023 7:00 AM

Source Node: 2893859

Republished By Plato

Followers: 0

Essentially, it uses a Raspberry Pi and a Respeaker four-mic array to listen to conversations in the room. It listens and records 15-20 seconds of audio, and sends that to the OpenWhisper API to generate a transcript.

This repeats until five minutes of audio is collected, then the entire transcript is sent through GPT-4 to extract an image prompt from a single topic in the conversation. Then, that prompt is shipped off to Stable Diffusion to get an image to be displayed on the screen. As you can imagine, the images generated run the gamut from really weird to really awesome.

The natural lulls in conversation presented a bit of a problem in that the transcription was still generating during silences, presumably because of ambient noise. The answer was in voice activity detection software that gives a probability that a voice is present.

Naturally, people were curious about the prompts for the images, so [TheMorehavoc] made a little gallery sign with a MagTag that uses Adafruit.io as the MQTT broker. Build video is up after the break, and you can check out the images here (warning, some are NSFW).

[embedded content]

SEO Powered Content & PR Distribution. Get Amplified Today.
PlatoData.Network Vertical Generative Ai. Empower Yourself. Access Here.
PlatoAiStream. Web3 Intelligence. Knowledge Amplified. Access Here.
PlatoESG. Carbon, CleanTech, Energy, Environment, Solar, Waste Management. Access Here.
PlatoHealth. Biotech and Clinical Trials Intelligence. Access Here.
Source: https://hackaday.com/2023/09/22/whisperframe-depicts-the-art-of-conversation/

Time Stamp: September 22, 2023

More from Hack A Day

This Month’s World’s Largest Wind Turbine Goes Operational

This Month’s World’s Largest Wind Turbine Goes Operational

Source Cluster:

Source Node: 2787159

Time Stamp: Jul 26, 2023

Soldering Holder From Old Lamps

Soldering Holder From Old Lamps

Source Cluster:

Source Node: 3085591

Time Stamp: Jan 27, 2024

No Acid: Open ICs With A Tesla Coil

No Acid: Open ICs With A Tesla Coil

Source Cluster:

Source Node: 2758558

Time Stamp: Jul 12, 2023

2022 FPV Contest: ESP32-Powered FPV Car Uses Javascript For VR Magic

2022 FPV Contest: ESP32-Powered FPV Car Uses Javascript For VR Magic

Source Cluster:

Source Node: 1850048

Time Stamp: Dec 30, 2022

Graphene and Copper Nanowire Thermal Interface with Low Thermal Resistance

Graphene and Copper Nanowire Thermal Interface with Low Thermal Resistance

Source Cluster:

Source Node: 1993725

Time Stamp: Mar 5, 2023

The Deere Disease Spreads To Trains

The Deere Disease Spreads To Trains

Source Cluster:

Source Node: 2996155

Time Stamp: Dec 6, 2023

Classic 1960s Flip Clock Gets NTP Makeover

Classic 1960s Flip Clock Gets NTP Makeover

Source Cluster:

Source Node: 2543107

Time Stamp: Mar 26, 2023

The Tale Of The Final EVGA GPU Overclocking Record

The Tale Of The Final EVGA GPU Overclocking Record

Source Cluster:

Source Node: 1916356

Time Stamp: Jan 23, 2023

RoboGaggia Makes Espresso Coffee On Its Own

Source Cluster:

Source Node: 2593864

Time Stamp: Apr 18, 2023

Stereoscopic Macro Lens Shows Two Is Better Than One

Stereoscopic Macro Lens Shows Two Is Better Than One

Source Cluster:

Source Node: 2993783

Time Stamp: Dec 4, 2023

ThunderScan: The Wild 1980s Product That Turned A Printer Into A Scanner

ThunderScan: The Wild 1980s Product That Turned A Printer Into A Scanner

Source Cluster:

Source Node: 3009411

Time Stamp: Dec 12, 2023

An Epic Quest to Build the Ultimate Game Boy

An Epic Quest to Build the Ultimate Game Boy

Source Cluster:

Source Node: 1867783

Time Stamp: Jan 4, 2023