Home Community ChatGPT Takes a Walk on the Robotic Side: Boston Dynamics’ Latest Mechanical Marvel Now Talks Back

ChatGPT Takes a Walk on the Robotic Side: Boston Dynamics’ Latest Mechanical Marvel Now Talks Back

ChatGPT Takes a Walk on the Robotic Side: Boston Dynamics’ Latest Mechanical Marvel Now Talks Back

In a groundbreaking development, engineering company Boston Dynamics has integrated ChatGPT, a complicated language model developed by OpenAI, into one in all its remarkable robots, Spot. This canine-like companion is now equipped to supply guided tours around a constructing, providing insightful commentary on each exhibit along the best way.

Spot has undergone a remarkable transformation, now boasting a number of distinctive personalities. Depending on the chosen persona, the robot’s voice, tone, and personalized remarks adapt accordingly. 

To perceive its surroundings, Spot employs Visual Query Answering (VQA) models, able to generating captions for images and providing concise responses to queries about them. This visual data is refreshed roughly once every second and conveyed to the system as a text prompt.

Spot’s communication capabilities have also been enhanced by adding a specially crafted vibration-resistant mount for a Respeaker V2 speaker, a ring-array microphone adorned with LEDs. This modern hardware seamlessly integrates with Spot’s EAP 2 payload via USB.

Control over the robot is managed by an offboard computer, either a desktop PC or a laptop, which communicates with Spot through its Software Development Kit (SDK). A simple Spot SDK service has been implemented to facilitate audio communication with the EAP 2.

Regarding verbal responses, Spot relies on the ElevenLabs text-to-speech service. To optimize response time, engineers have devised a system where text is streamed to the tool in parallel as “phrases”, and the resulting audio is played back serially.

Adding a touch of personality, Spot now exhibits body language capabilities. It could discover and track moving objects, enabling it to discern the situation of the closest person and orient its arm towards them. To create a whimsical touch, a lowpass filter has been applied to the generated speech, mimicking the motion of a puppet’s mouth. This effect is further accentuated by adorning the gripper with comical costumes and affixing googly eyes.

One of the intriguing features of this experiment lies within the AI’s inherent logic, which required minimal fine-tuning. When questioned about its “parents,” Spot astoundingly navigated to the situation where its predecessors resided, humorously declaring them to be its “elders.” It is a testament to the model’s ability to determine statistical associations between concepts without implying consciousness.

Nonetheless, it’s value noting that the demonstration does have its limitations. Spot, like many language models, may occasionally experience hallucinations, where it generates fictitious information. An intriguing example of this phenomenon will be present in an article discussing a Sims-inspired town populated by AI agents. Moreover, there may be a slight delay in responses, with users occasionally experiencing a wait time of roughly six seconds.

Despite these minor setbacks, this project marks a major stride forward in research on the intersection of robotics and AI. Boston Dynamics is committed to further exploring this fusion of technologies, with the final word aim of enhancing robotic performance in human-centric environments. This promising endeavour holds the potential to revolutionize the best way we interact with machines, ushering in a brand new era of intelligent companionship.

Take a look at the Reference Article. All Credit For This Research Goes To the Researchers on This Project. Also, don’t forget to hitch our 32k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the most recent AI research news, cool AI projects, and more.

When you like our work, you’ll love our newsletter..

We’re also on Telegram and WhatsApp.


” data-medium-file=”https://www.marktechpost.com/wp-content/uploads/2023/01/1674480782181-Niharika-Singh-264×300.jpg” data-large-file=”https://www.marktechpost.com/wp-content/uploads/2023/01/1674480782181-Niharika-Singh-902×1024.jpg”>

Niharika is a Technical consulting intern at Marktechpost. She is a 3rd 12 months undergraduate, currently pursuing her B.Tech from Indian Institute of Technology(IIT), Kharagpur. She is a highly enthusiastic individual with a keen interest in Machine learning, Data science and AI and an avid reader of the most recent developments in these fields.

🔥 Meet Retouch4me: A Family of Artificial Intelligence-Powered Plug-Ins for Photography Retouching


Please enter your comment!
Please enter your name here