CLIMATE DISCOURSE ON REDDIT - LOW-AI Dashboard

What drives climate action in diets, mobility and homes?

This dashboard enables exploration of climate discourse on Reddit - a social media platform - between 2010 and 2024. It focuses on the attitudes towards climate action in three consumption domains, that is, food, mobility and residential energy. It displays an LLM-supported analysis of the drivers and barriers of adoption of low-carbon behaviors.  ·  For more information on data and methodology please see our methodology webpage or contact us at chowdhary@iiasa.ac.at / eker@iiasa.ac.at.
Data source
|
The direct LLM view shows the original labels on the smaller review sample. The ModernBERT view shows the same questions after those labels have been used to scale the analysis across the much larger Reddit corpus. LLM labels come from the larger direct model and are slower, but more context aware. We use them to create the training data. SLM labels come from ModernBERT after it is trained on those LLM labels. We use this scaling step because the full Reddit dataset is too large to label directly with the larger model every time. The SLM is much faster on the large corpus, while the LLM view lets readers inspect the direct labels themselves.
Keyword frequency in the full Reddit corpus per sector. Word size is proportional to log(count).
Corpus keywords
% of Reddit comments per attitude or barrier for the top 5 survey questions in this domain, ranked by mean prevalence in the later 2010–2024 scaled survey series.
% association
Filtering for lifestyle-specific Reddit discussions
FOOD / DIETS
TRANSPORT / EVs
HOMES / SOLAR PV
We first need a way to isolate the parts of Reddit discourse that are actually about a specific climate-relevant domain. These are search words and regex cues used to filter out irrelevant Reddit noise and isolate sector-specific discussion that is actually about food, transport, or home solar. Word size shows how often those terms appear in the selected corpus.
corpus keyword frequency

Prominence of factors affecting adoption of specific lifestyles on Reddit

AI can help us detect lifestyle choices and the factors influencing those choices in social-media discourse, even when the data were never originally collected for that purpose. With modern language models, we can project large-scale text into subspaces defined by the questions we choose to ask, without need for human labeling. In our case, we track survey questions on Reddit using LLMs.

Here we show the percentage of Reddit comments in which a given attitude or barrier is expressed. These numbers do not measure individual preferences the way surveys do. Instead, they capture an orthogonal dimension: how prominent a factor becomes in collective attention when lifestyle choices are discussed in public.

% of comments showing factor
% of Reddit comments in which a given attitude or barrier is expressed. Top 5 by prominence and confidence.
% yes by year (2010–2024) for the top 5 survey questions per domain under the currently selected label source. Year-over-year trends are the intended use; §3.1 explains the smaller direct LLM checkpoint table separately.
% of comments showing association by year