Climate mitigation requires more than technological solutions—it also depends on behaviour change shaped by individual, social, and structural factors^[1,2,12]. Collective behaviours and social organisation are part of everyday life, and feeling part of active collective action can render mitigation measures more efficient and pervasive^[13]. Social and cultural processes play an important role in shaping what actions people take on climate mitigation, interacting with individual, structural, institutional, and economic drivers^[14]. Just like infrastructure, social and cultural processes can lock societies into carbon-intensive patterns of service delivery. They also offer potential levers to change normative ideas and social practices in order to achieve extensive emissions cuts^[13,14]. Here, behaviour is treated as a discrete action in a specific consumption domain, such as shifting to a plant-based diet or using an electric vehicle instead of an internal combustion engine vehicle^[12]. Individual drivers include cognitive and psychological factors such as climate risk perception or self-efficacy. Factors such as comfort, status, identity, and agency are associated with many technologies and everyday social practices that deliver energy services, from driving a car to eating vegan food^[15,16]. Social drivers include the social costs of adopting or not adopting a behaviour, especially social norms and their influence. Action on climate mitigation is influenced by our perception of what other people commonly do, think, or expect, known as social norms^[2]. Second-order beliefs—perceptions of what others in the community believe—are particularly important for leveraging descriptive norms^[6]. Structural drivers include the surrounding socioeconomic and technical context, such as income, technology cost, availability, and convenience. Symbolic motives are more important predictors of technology adoption than instrumental motives^[17,18].

Together, Food/Diets, Transport/EVs, and Homes/solar PV account for a large share of household carbon footprints and a large share of the public choices that shape climate mitigation. The current dashboard focuses on these three domains because they correspond to three concrete low-carbon behaviours: plant-based diets in food, electric-vehicle adoption in mobility, and rooftop solar PV adoption in homes. They also differ in how people make sense of them: transport choices are strongly entangled with symbolic car use and status^[15,16], food choices are tied to health, climate, and identity-linked reasoning^[12], and household solar adoption depends not only on costs and convenience but also on trust in providers and institutions^[18,19,20].

Conventional ways of measuring readiness for these behaviour changes—surveys, experiments, polls, and related instruments—are useful but costly, slow, and limited in temporal resolution^[3,5,7,9]. Trust in organisations is also a key predictor of the take-up of novel energy services, particularly when financial incentives are high^[18,19,20]. Social media can complement these approaches. Reddit is especially useful here because it contains large volumes of open-ended, relatively nuanced discussion about everyday choices, reasons, barriers, and public reactions. This dashboard therefore uses Reddit as a large-scale naturalistic record of climate-relevant lifestyle discussion, rather than relying on surveys alone.

The objective is to explore the drivers and barriers of low-carbon behaviours using open-ended Reddit posts. For individual and structural drivers, the dashboard compares Reddit discourse with existing survey questions and measures how often those survey ideas appear in posts. For social drivers, it examines social influence through descriptive norms, injunctive norms, and reference groups. Across both parts of the workflow, large language models are used and their reliability is checked through repeated labeling, cross-model checks, and manual review. The result is both a substantive picture of climate discourse on Reddit and a methodological demonstration of how large language models can be used to complement survey-based measurement.

1.1 Knowledge Gap

IPCC AR6 WGIII highlights a knowledge gap in understanding the dynamic interaction between individual, social, and structural drivers of change, and in particular asks how social media influences the development and impacts of narratives about low-carbon transitions^[1]. More research is needed to assess the role played by social media platforms in influencing emerging narratives of climate change and low-carbon transition^[21]. Traditional measures of societal readiness—public opinion polls, climate opinion maps, and related survey-style instruments can be slow, costly, or limited in temporal granularity^[8,9,12]. Existing social-media studies also tend to compress discourse into broad sentiment categories^[10], ignoring the richer norm taxonomy that behavioural science has shown to matter (descriptive vs. injunctive, reference group specificity, and the difference between strict norm statements and public normative performances in Reddit-style discourse). No prior study has tracked survey questions directly in social-media data at scale. This dashboard projects Reddit discourse into subspaces defined by established survey instruments, revealing how prominent each factor becomes in collective attention when lifestyle choices are discussed publicly—capturing an orthogonal dimension to individual preferences measured by surveys.

1.2 Contributions

This dashboard uses Reddit discourse to extend and complement past survey-based work on climate-related behaviour. It asks whether the very same questions from carefully crafted survey instruments can be used with AI, at scale, to extract meaningful insights from observed public discourse rather than only from self-reported responses. In that sense, the dashboard does not simply reproduce survey categories on a new platform; it tracks an orthogonal dimension of collective attention, namely which motivators, barriers, and social pressures are publicly evoked when people discuss food, transport, and home-energy choices online. At a broader level, narratives about climate mitigation circulate within and across societies, and enable people to imagine and make sense of the future through processes of interpretation, understanding, communication, and social interaction^[17]. The analysis therefore uses AI to recover structured signals from messy social-media text while also tracing social norms—approval, disapproval, observed behaviour, and reference groups—in public conversation.

2. Data

2.1 Data Collection and Keyword Filtering

SCHEMATIC SHOWING OUR APPROACH

Raw Reddit 20.1M posts

r/vegan r/veganarchism r/vegancirclejerk

r/electricvehicles r/ElectricScooters r/Electricmotorcycles

r/solar

r/climate r/ClimateActionPlan r/climatechange · ...

⫯ regex filter

Sector-relevant posts

Housing

Transport

Food

3.7M total matched

Survey questions (Lifestyle Adoption Factors)

Sample
29 survey questions · 1,500 comments per question, oversampled up to 5,000 where the answer categories were highly imbalanced

Prompt format

E.g.: “Links reducing meat to health?”

Yes/No classification

Social norms (Social Influence)

Sample
24,000 unique comments from the survey-aligned sample

What is measured

Norm presence

Descriptive norm

Injunctive norm

Reference group

🤖 (9-billion parameter model)

LLM labels the sample (creating high quality training data)

labeled training data on
lifestyle survey questions and
social influence detection

(~293M parameter model)

ModernBERT-base trained to scale labeling efficiently.

As shown in the schematic, we begin with the matched Reddit corpus and then split the analysis into two linked branches: survey questions and social norms. The first branch uses a large language model to extract insights related to established survey questions from Reddit discourse, extending past survey-based work on climate-related behaviour to a naturalistic record of public discussion. The second branch detects the presence of social norms in the same data. This dual approach addresses a gap highlighted by IPCC AR6 WGIII, which calls for better understanding of how social media influences narratives about low-carbon transitions and how individual, social, and structural drivers of change interact dynamically.

3.1 Classifying survey questions in the matched Reddit corpus

The survey branch begins by defining which published survey questions are being tracked and how the training sample is constructed. Table 1 documents the survey-frame questions and their source trace. The sampling block below reports the direct LLM checkpoint and the larger ModernBERT inference sample used in the current dashboard version.

We then test whether the survey-style labels are stable enough to use as training data. To do this, we run the LLM twice on the same comments with the same prompt and compare the two outputs. Table 2 shows how this robustness check, together with the observed YES-rate, separates rare questions, unreliable questions, and questions that are kept for scaling. Table 3 then shows what happens in the next step, where those LLM labels are used to train ModernBERT and to decide which questions are strong enough to foreground in the Lifestyle Adoption Factors plots.

We then ask whether the same comment pool can support stable social-norm labels for norm presence, descriptive norms, injunctive norms, and reference groups. Here too, the LLM first creates the training labels and ModernBERT is trained on those labels in the next step. Table 4 shows the held-out model quality for these four social-norm targets and indicates which parts of the Social Influence tab are currently on stronger footing than others.

The current dashboard slice uses four active social-norm questions per comment. The schema table below makes those targets explicit before the model-quality results are shown.

Social Norms (current dashboard slice: 4 active social-norm questions per comment, all sectors):

ID	Question	Options
`1.1_gate`	Does the comment reference a social norm?	yes / no
`1.2.1_descriptive`	Descriptive norm present?	present / absent / unclear
`1.2.2_injunctive`	Injunctive norm present?	present / absent / unclear
`1.3.1_reference_group`	Which social group is referenced?	coworkers / family / friends / general public / identity group / local community / neighbors / online community / other / other reddit users / partner/spouse

The F1 scores across all four social-norm targets are generally strong, indicating that ModernBERT learns the LLM-derived labels well. However, descriptive norms appear harder for the small model to predict than injunctive norms, where performance is noticeably better.

3.3 Computing confidence in the 9B-generated labels using a stronger 27B model

We ask whether the direct LLM labels themselves are credible enough to support the later training and scaling steps. For this purpose, we run a stronger model, Qwen 3.6 27B, on a sampled review set drawn from the survey and social-norm labels. Table 5 reports that external confidence check. These are the same confidence values that are then reported back into the dashboard plots, so that the visible plot annotations reflect stronger-model agreement with the original 9B labels rather than small-model scaleability. This step does not replace the current pipeline, but it indicates where the existing labels are reliable and where caution is warranted. In principle, one could continue to select ever-larger models as judges, but this escalation has no natural stopping point. Instead, our aim is to demonstrate a practical approach in which stronger, more computationally expensive models serve as reference judges for smaller, faster, and cheaper models. This establishes a scalable hierarchical system that, in general, can extract climate insights from large-scale social media data without requiring human labelling.

The 27B model's confidence check confirms the pattern seen in section 3.2: descriptive norms are harder for even the larger 9B model to identify reliably, compared to injunctive norms. This may reflect that the definitions of descriptive norms are somewhat vaguer; observing what others do is a subtler signal than explicit approval or disapproval. Overall, social-norm labels show lower confidence than survey-question labels.

3.4 Comparing Surveys to Reddit: Prevalence of Key Frames in U.S. Public Opinion

Language models let us project large-scale social-media text into subspaces defined by survey-style questions, even though those data were never collected for that purpose. What emerges is not a measure of individual preferences. Rather, it reflects how visible a given factor becomes in public conversation when climate action is on the table, an orthogonal dimension to what structured surveys capture.

Reddit % is the share of comments in which an LLM classifier flagged the frame as present (YES/NO, direction-agnostic). Survey % combines "strongly" + "somewhat" agree (Pew) or "very" + "somewhat" willing/positive (Gallup; Yale), The two measures capture related but distinct constructs and are not directly comparable.

The survey data show that respondents broadly agree all these factors influence their choices, with comparable agreement rates across questions. Reddit discourse tells a different story: it naturally differentiates between more commonly discussed frames, such as animal welfare in food choice, and those that receive far less public attention.

3.5 Limitations and Future Directions

As the reader may have noticed, the LLM tracks easier versions of the original survey questions in some instances. This is deliberate, as we found in our experiments that the complexity of a question can stump an LLM of a certain size; a larger model may resolve this at the cost of more compute and parameters.

This limitation has another cause, separate from question complexity. The Reddit data are not self-reported descriptions of individuals; they are better understood as projecting large-scale text into subspaces defined by the questions we choose to ask. There is no guarantee that Reddit discourse supports such projections for arbitrary survey questions.

In future work, as models improve in capability and efficiency, the same methodology could be applied to track arbitrary climate-change-related questions across large-scale textual or video data. This would extend the dashboard's approach beyond Reddit to other platforms and modalities, enabling broader monitoring of public discourse on low-carbon transitions.

F. Creutzig, J. Roy, P. Devine-Wright, J. Diaz-Jose, F.W. Geels, A. Grubler, N. Maizi, E. Masanet, Y. Mulugetta, C.D. Onyige, P.E. Perkins, A. Sanches-Pereira, and E.U. Weber, “Demand, services and social aspects of mitigation,” in Climate Change 2022: Mitigation of Climate Change. IPCC AR6 WG3, P.R. Shukla et al., Eds. Cambridge University Press, pp. 503–612, 2022. doi:10.1017/9781009157926.007
R. Cialdini, R. Reno, and C. Kallgren, “A focus theory of normative conduct: Recycling the concept of norms to reduce littering in public places,” J. Personality and Social Psychology, vol. 58, pp. 1015–1026, 1990.
D. Miller and D. Prentice, “Changing norms to change behavior,” Annual Review of Psychology, vol. 67, pp. 339–361, 2016.
J. Bonan, C. Cattaneo, G. d’Adda, and M. Tavoni, “The interaction of descriptive and injunctive social norms in promoting energy conservation,” Nature Energy, vol. 5, pp. 900–909, 2020.
W. Abrahamse and L. Steg, “Social influence approaches to encourage resource conservation: A meta-analysis,” Global Environmental Change, vol. 23, pp. 1773–1785, 2013.
J. Jachimowicz et al., “The critical role of second-order normative beliefs in predicting energy conservation,” Nature Human Behaviour, vol. 2, pp. 757–764, 2018.
C. Mortensen et al., “Trending norms: A lever for encouraging behaviors performed by the minority,” Social Psychological and Personality Science, vol. 10, pp. 201–210, 2019.
A. Tyson and B. Kennedy, “How Americans View National, Local and Personal Energy Choices,” Pew Research Center, June 27, 2024; see also S. Bestvater and S. Shah, “Electric Vehicle Charging Infrastructure in the U.S.,” Pew Research Center, May 23, 2024.
Yale Program on Climate Change Communication, “Yale Climate Opinion Maps,” 2023.
A. Leiserowitz, M. Ballew, S. Rosenthal, and J. Semaan, “Climate Change and the American Diet,” Yale University and Earth Day Network, New Haven, CT: Yale Program on Climate Change Communication, Feb. 13, 2020.
B. Sigrin, T. Dietz, A. Henry, A. Ingle, L. Lutzenhiser, M. Moezzi, S. Spielman, P. Stern, A. Todd, J. Tong, and K. Wolske, “Understanding the Evolution of Customer Motivations and Adoption Barriers in Residential Solar Markets: Survey Data,” NREL Data Catalog, Golden, CO: National Renewable Energy Laboratory, 2017. doi:10.7799/1362095.
W. Pearce et al., “Climate change on Twitter: topics, communities and conversations about the 2013 IPCC Working Group 1 report,” PLOS ONE, vol. 9, no. 4, e94785, 2014.
C. Sunstein, How Change Happens, MIT Press, 2019.
H. Pettifor, M. Agnew, and C. Wilson, “A framework for measuring and modelling low-carbon lifestyles,” Global Environmental Change, vol. 82, 102739, 2023. doi:10.1016/j.gloenvcha.2023.102739
Climact, “Net Zero by 2050: from whether to how,” technical report, 2018.
S. Barr and J. Prillwitz, “A stronger sustainability?” Geoforum, vol. 52, pp. 1–10, 2014.
L. Steg, “Car use: lust and must. Instrumental, symbolic and affective motives for car use,” Transportation Research Part A, vol. 39, pp. 147–162, 2005.
E.H. Noppers, K. Keizer, J.W. Bolderdijk, and L. Steg, “The adoption of sustainable innovations: driven by symbolic and environmental motives,” Global Environmental Change, vol. 25, pp. 52–62, 2014.
J. Smith, R.A. Butler, and L. Gibbs, “Gathering around stories: The role of storytelling in climate change adaptation,” Geography Compass, vol. 11, e12316, 2017.
L. Lutzenhiser, “Social and behavioral aspects of energy use,” Annual Review of Energy and the Environment, vol. 18, pp. 247–289, 1993.
P.C. Stern, E. Aronson, J.M. Darley, D.H. Hill, E. Hirst, W. Kempton, and T.J. Wilbanks, “The effectiveness of incentives for residential energy conservation,” Evaluation Review, vol. 9, pp. 147–176, 1985.
R. Friedmann and C. Sheinbaum, “Implementing renewable energy technologies in Mexico: public/private partnerships,” Energy Policy, vol. 26, pp. 1135–1143, 1998.
W. Pearce, S. Niederer, S. O'Neill, and M. Hurlstone, “Climate change on social media: the role of platforms in public engagement,” Wiley Interdisciplinary Reviews: Climate Change, vol. 10, e604, 2019.

CLIMATE DISCOURSE ON REDDIT - LOW-AI Dashboard

CLIMATE DISCOURSE ON REDDIT - LOW-AI Dashboard

1. Introduction

1.1 Knowledge Gap

1.2 Contributions

2. Data

2.1 Data Collection and Keyword Filtering

3. Results

SCHEMATIC SHOWING OUR APPROACH

3.1 Classifying survey questions in the matched Reddit corpus

3.3 Computing confidence in the 9B-generated labels using a stronger 27B model

3.4 Comparing Surveys to Reddit: Prevalence of Key Frames in U.S. Public Opinion

3.5 Limitations and Future Directions

References

CLIMATE DISCOURSE ON REDDIT - LOW-AI Dashboard

CLIMATE DISCOURSE ON REDDIT - LOW-AI Dashboard

1. Introduction

1.1 Knowledge Gap

1.2 Contributions

2. Data

2.1 Data Collection and Keyword Filtering

3. Results

SCHEMATIC SHOWING OUR APPROACH

3.1 Classifying survey questions in the matched Reddit corpus

3.2 Evaluating the ModernBERT social-norm model on held-out data

3.3 Computing confidence in the 9B-generated labels using a stronger 27B model

3.4 Comparing Surveys to Reddit: Prevalence of Key Frames in U.S. Public Opinion

3.5 Limitations and Future Directions

References