Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Eco-Amazon: Enriching E-commerce Datasets with Product Carbon Footprint for Sustainable Recommendations

Authors: Spillo, Giuseppe; De Filippo, Allegra; Musto, Cataldo; Milano, Michela; Semeraro, Giovanni;

Eco-Amazon: Enriching E-commerce Datasets with Product Carbon Footprint for Sustainable Recommendations

Abstract

This repository contains the Amazon datasets enriched with Product Carbon Footprint (PCF). Such dataset have been obtained by prompting state-of-the-art Large Language Models (LLMs) for estimating the PCF of Amazon products, in the Clothing, Electronics, Home & Kitchen domains. In particular, we exploit Google Gemini 2.5 Flash and OpenAI o3-mini to infer CO2e emissions based on product metadata, following strictly defined Life Cycle Assessment (LCA) standards (GHG Protocol, ISO 14040/14044). More details about the way these datasets have been enriched can be found in the associated paper and our repository for the source code. Metric Electronics Home & Kitchen Clothing Total Users 21,751 66,810 97,608 Total Items 11,495 17,027 21,380 Total Ratings 464,464 684,651 1,070,586 We provide these datasets in two forms: item metadata enriched with PCF estimations, one per LLM, in json format. For example, these are the Clothing datasets enriched with PCF estimations provided by Gemini-2.5-flash and GPT-o3-mini: clothing_gemini.jsonlclothing_o3mini.jsonl datasets in the RecBole format, used in our use case, whose code can be found in our GitHub Repository. As an example, the Amazon Clothing dataset in the RecBole format is the following: amazon_clothing.zip

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average