Home Community Researchers from MIT and Harvard Introduce Language Models Trained on Media Diets that may Predict Public Opinion

Researchers from MIT and Harvard Introduce Language Models Trained on Media Diets that may Predict Public Opinion

0
Researchers from MIT and Harvard Introduce Language Models Trained on Media Diets that may Predict Public Opinion

Traditional survey-based approaches for measuring public opinion have limitations, but public opinion reflects and influences society’s behavior. Questions on the extent to which AI can understand and adopt human-language-based attitudes have to be explored. Answering these problems has develop into increasingly pressing as huge language models develop and develop into more commonly utilized, because of recent work like GPT3, PaLM, ChatGPT, Claude, and Bard. 

A recent work by MIT and Harvard University follows within the footsteps of other recent advances in natural language processing software that summarize large datasets to assist human decision-making. They present a brand new method for investigating media food regimen models, that are modified language models that mimic the perspectives of subpopulations based on their consumption of certain media (reminiscent of web news, TV broadcasts, or radio shows).

Predictive power, robustness to query framing, effectiveness across media types, and the presence of predictive signals after accounting for demographics are all demonstrated for media food regimen models in public health and economic contexts. Additional analyses show how they’re sensitive to the extent of attention, individuals give to the news and the way their impacts vary depending on the variety of inquiry asked.

🚀 JOIN the fastest ML Subreddit Community

To anticipate how a subpopulation will answer a survey query, the team employs a computational model that inputs an outline of the subpopulation’s media food regimen and the query being asked. In silico public opinion models might be used in the event that they can accurately forecast the outcomes of human surveys. Questions of public sentiment (reminiscent of “How do people feel concerning the pandemic”) and scientific inquiry into media effects (reminiscent of “How does media food regimen affect perceptions of the pandemic”) may very well be aided by such an approach.

There are three stages to developing a model for a media food regimen: 

  1. A language model is developed or used to predict omitted words in a document. On this work, they mostly employ BERT, a pretrained model. 
  2. Modifying the language model by training it on a media food regimen dataset includes content from various media outlets covering a certain timeframe. The researchers use TV and radio to indicate transcripts and web news. This modification lets the model absorb fresh data while concurrently refreshing its internal knowledge representations. 
  3. Asking these models inquiries to see if their response distributions reflect those of populations with different dietary patterns based on the media they devour. They analyze responses to survey questions by querying the media food regimen model. 

The researchers employ regression models during which (i) is used to predict (ii) to undertake public opinion forecasting. The polling information comes from statewide surveys regarding COVID-19 and consumer confidence. Finally, they employ the closest neighbor method to trace the source media food regimen datasets from which the forecasts for a selected survey query were derived.

The importance of media food regimen research is bolstered by three interconnected issues: 

  1. Selective exposure, or the broad systemic bias during which people gravitate towards information that’s coherent with their prior ideas
  2. Echo chambers, where beliefs shared amongst like-minded individuals are amplified and strengthened by the environment chosen
  3. Filter bubbles, where content curation and suggestion algorithms surface items based on users’ past activities, again reinforcing the users’ worldviews.

Models of the media food regimen may very well be used to find out which groups are receiving probably the most potentially hazardous messages. In addition they provide a way for research into the more nuanced effects of communications, reminiscent of the variation in resonance attributable to variations in word selection. While this has been investigated in controlled lab settings and, to a lesser extent, online, researchers specializing in media effects have been hampered by a scarcity of appropriate tools.

The team that these models will eventually be used to assist solve real-world problems with a concentrate on people.


Try the Paper. Don’t forget to hitch our 19k+ ML SubRedditDiscord Channel, and Email Newsletter, where we share the newest AI research news, cool AI projects, and more. If you’ve any questions regarding the above article or if we missed anything, be at liberty to email us at Asif@marktechpost.com

🚀 Check Out 100’s AI Tools in AI Tools Club


Tanushree

” data-medium-file=”https://www.marktechpost.com/wp-content/uploads/2020/10/Tanushree-Picture-225×300.jpeg” data-large-file=”https://www.marktechpost.com/wp-content/uploads/2020/10/Tanushree-Picture-768×1024.jpeg”>

Tanushree Shenwai is a consulting intern at MarktechPost. She is currently pursuing her B.Tech from the Indian Institute of Technology(IIT), Bhubaneswar. She is a Data Science enthusiast and has a keen interest within the scope of application of artificial intelligence in various fields. She is obsessed with exploring the brand new advancements in technologies and their real-life application.


🚀 JOIN the fastest ML Subreddit Community

LEAVE A REPLY

Please enter your comment!
Please enter your name here