
This README.txt file was generated on by ------------------GENERAL INFORMATION------------------- Title of Dataset: Author Information Principal Investigator Project PID2021-123983OB-I00 [NON-CONSPIRA-HATE!]: Estrella Gualda, Universidad de Huelva, ESEIS/COIDESO/CISCOA-Lab, Facultad de Trabajo Social, Avda. Tres de Marzo, s/n, 21007-Huelva, estrella@uhu.es, ORCID: http://orcid.org/0000-0003-0220-2135> Date and description of data collection: Funding: General description: Keywords: --------------------------SHARING/ACCESS INFORMATION-------------------------- Data availability: Data Request Form: Rights and permissions: Citation to publications that cite or use the data: --------------------DATA & FILE OVERVIEW-------------------- File list: Total size: --------------------------METHODOLOGICAL INFORMATION-------------------------- Data collection: --------------------------DATA-SPECIFIC INFORMATION: TWEET DATA JSON STRUCTURE AND EXAMPLE-------------------------- [{ "_id": { "$oid": "63e4c1beb82b43c83e3ce873" }, "author_id": "702461646204243968", "possibly_sensitive": false, "created_at": "2022-12-31T23:58:14.000Z", "edit_controls": { "edits_remaining": 5, "is_edit_eligible": false, "editable_until": "2023-01-01T00:28:14.000Z" }, "text": "@Dandastur Yo dir�a que no, antes han puesto con la orquesta gente joven (cito de memoria, algo de gracia perder�)\n\"Han ido posponiendo el cambio de nombre, como el PSOE con la ley trans\"\n\"Bueno, como el PSOE en general\"", "public_metrics": { "retweet_count": 0, "reply_count": 0, "like_count": 1, "quote_count": 0, "impression_count": 44 }, "in_reply_to_user_id": "259734564", "id": "1609338172235825153", "conversation_id": "1609336914603298816", "edit_history_tweet_ids": [ "1609338172235825153" ], "entities": { "mentions": [ { "start": 0, "end": 10, "username": "Dandastur", "id": "259734564", "created_at": "2011-03-02T13:56:02.000Z", "pinned_tweet_id": "728862357594750976", "entities": { "url": { "urls": [ { "start": 0, "end": 22, "url": "http://t.co/WemORp7KAf", "expanded_url": "http://www.jugadoresdefortuna.com", "display_url": "jugadoresdefortuna.com" } ] }, "description": { "urls": [ { "start": 112, "end": 135, "url": "https://t.co/c7TFPHBAEq", "expanded_url": "https://play.spotify.com/album/7ops7tph6jMMwsjVejFxbw", "display_url": "play.spotify.com/album/7ops7tph�" } ] } }, "location": "Entre Frankfurt y Asturias", "protected": false, "name": "Danda", "profile_image_url": "https://pbs.twimg.com/profile_images/997205486272311296/LkS3sRzU_normal.jpg", "description": "Traductor de series y videojuegos. Izquierdista pesado. En internet desde 1996. Fui bater�a de rock progresivo: https://t.co/c7TFPHBAEq Aviso: RETUITEO MUCHO", "url": "http://t.co/WemORp7KAf", "verified": false, "public_metrics": { "followers_count": 2782, "following_count": 922, "tweet_count": 669895, "listed_count": 156 } } ], "annotations": [ { "start": 165, "end": 168, "probability": 0.9728, "type": "Organization", "normalized_text": "PSOE" }, { "start": 204, "end": 207, "probability": 0.9782, "type": "Organization", "normalized_text": "PSOE" } ] }, "context_annotations": [ { "domain": { "id": "131", "name": "Unified Twitter Taxonomy", "description": "A taxonomy of user interests. " }, "entity": { "id": "847878884917886977", "name": "Politics", "description": "Politics" } }, { "domain": { "id": "131", "name": "Unified Twitter Taxonomy", "description": "A taxonomy of user interests. " }, "entity": { "id": "1516798152279425028", "name": "Spain politics" } } ], "lang": "es", "referenced_tweets": [ { "type": "replied_to", "id": "1609336914603298816", "author_id": "259734564", "possibly_sensitive": false, "created_at": "2022-12-31T23:53:15.000Z", "edit_controls": { "edits_remaining": 5, "is_edit_eligible": true, "editable_until": "2023-01-01T00:23:15.000Z" }, "text": "Por favor, que alguien me confirme que la TVE facha actual NO le ha pasado la censura y el filtro fachificador a los chistes de este a�o, veo muchos datos y pocos chistes. Me encantaban las hostias que le repart�an a Rivera antes de que se quitase de en medio. https://t.co/4TllwALWqb", "public_metrics": { "retweet_count": 0, "reply_count": 2, "like_count": 0, "quote_count": 0, "impression_count": 308 }, "conversation_id": "1609336914603298816", "edit_history_tweet_ids": [ "1609336914603298816" ], "entities": { "urls": [ { "start": 261, "end": 284, "url": "https://t.co/4TllwALWqb", "expanded_url": "https://twitter.com/c_3peor/status/1609333955156639744", "display_url": "twitter.com/c_3peor/status�" } ], "annotations": [ { "start": 42, "end": 44, "probability": 0.519, "type": "Organization", "normalized_text": "TVE" }, { "start": 217, "end": 222, "probability": 0.853, "type": "Person", "normalized_text": "Rivera" } ] }, "lang": "es", "referenced_tweets": [ { "type": "quoted", "id": "1609333955156639744" } ], "reply_settings": "everyone", "author": { "created_at": "2011-03-02T13:56:02.000Z", "id": "259734564", "pinned_tweet_id": "728862357594750976", "entities": { "url": { "urls": [ { "start": 0, "end": 22, "url": "http://t.co/WemORp7KAf", "expanded_url": "http://www.jugadoresdefortuna.com", "display_url": "jugadoresdefortuna.com" } ] }, "description": { "urls": [ { "start": 112, "end": 135, "url": "https://t.co/c7TFPHBAEq", "expanded_url": "https://play.spotify.com/album/7ops7tph6jMMwsjVejFxbw", "display_url": "play.spotify.com/album/7ops7tph�" } ] } }, "location": "Entre Frankfurt y Asturias", "protected": false, "username": "Dandastur", "name": "Danda", "profile_image_url": "https://pbs.twimg.com/profile_images/997205486272311296/LkS3sRzU_normal.jpg", "description": "Traductor de series y videojuegos. Izquierdista pesado. En internet desde 1996. Fui bater�a de rock progresivo: https://t.co/c7TFPHBAEq Aviso: RETUITEO MUCHO", "url": "http://t.co/WemORp7KAf", "verified": false, "public_metrics": { "followers_count": 2782, "following_count": 922, "tweet_count": 669895, "listed_count": 156 } } } ], "reply_settings": "everyone", "author": { "created_at": "2016-02-24T11:54:42.000Z", "id": "702461646204243968", "protected": false, "username": "echidnamoroso", "name": "Echidnamoroso", "profile_image_url": "https://pbs.twimg.com/profile_images/747020599445295104/y9Z5F1wk_normal.jpg", "description": "", "verified": false, "public_metrics": { "followers_count": 42, "following_count": 248, "tweet_count": 2170, "listed_count": 0 } }, "in_reply_to_user": { "created_at": "2011-03-02T13:56:02.000Z", "id": "259734564", "pinned_tweet_id": "728862357594750976", "entities": { "url": { "urls": [ { "start": 0, "end": 22, "url": "http://t.co/WemORp7KAf", "expanded_url": "http://www.jugadoresdefortuna.com", "display_url": "jugadoresdefortuna.com" } ] }, "description": { "urls": [ { "start": 112, "end": 135, "url": "https://t.co/c7TFPHBAEq", "expanded_url": "https://play.spotify.com/album/7ops7tph6jMMwsjVejFxbw", "display_url": "play.spotify.com/album/7ops7tph�" } ] } }, "location": "Entre Frankfurt y Asturias", "protected": false, "username": "Dandastur", "name": "Danda", "profile_image_url": "https://pbs.twimg.com/profile_images/997205486272311296/LkS3sRzU_normal.jpg", "description": "Traductor de series y videojuegos. Izquierdista pesado. En internet desde 1996. Fui bater�a de rock progresivo: https://t.co/c7TFPHBAEq Aviso: RETUITEO MUCHO", "url": "http://t.co/WemORp7KAf", "verified": false, "public_metrics": { "followers_count": 2782, "following_count": 922, "tweet_count": 669895, "listed_count": 156 } }}] Ed Summers, Igor Brigadir, Sam Hames, Hugo van Kemenade, Peter Binkley, tinafigueroa, Nick Ruest, Walmir, Dan Chudnov, David Thiel, Betsy, Ryan Chartier, celeste, Hause Lin, Alice, Andy Chosak, Mirko Lenz, R. Miles McCain, Ian Milligan, Andreas Segerberg, Daniyal Shahrokhian, Melanie Walsh, Leonard Lausen, Nicholas Woodward, eggplants, Ashwin Ramaswami, Boyd Nguyen, Dar�o Here��, Dmitrijs Milajevs, and Frederik Elwert (2023). Docnow/twarc: v2.14.0. Zenodo [Computer Software]. https://doi.org/10.5281/zenodo.7799050More information on twarc in:- Twarc2. https://twarc-project.readthedocs.io/en/latest/twarc2_en_us/- GitHub. https://github.com/DocNow/twarc
The LGBTQI+ Dataset 2020-2022_es is a collection of 410,015 original tweets extracted from the social network Twitter between January 1, 2020, and December 31, 2022. To ensure data quality and relevance, retweets, replies, and other duplicate content were excluded, retaining only original tweets. The tweets were collected by Jacinto Mata (University of Huelva, I2C/CITES) with the support of the Python programming language and using the twarc2 tool and the Academic API v2 of Twitter. Tbis data collection is part of the project “Conspiracy Theories and Hate Speech Online: Comparison of patterns in narratives and social networks about COVID-19, immigrants and refugees and LGBTI people [NON-CONSPIRA-HATE!]”, PID2021-123983OB-I00, funded by MCIN/AEI/10.13039/501100011033/ by FEDER/EU. The search criteria (words and hashtags) used for the data collection followed the objectives of the aforementioned project and were defined by Estrella Gualda, Francisco Javier Santos Fernández and Jacinto Mata (University of Huelva, Spain). Terms and hashtags used for the search and extraction of tweets were: #orgullogay, #orgullotrans, #OrgulloLGTB, #OrgulloLGTBI, #Díadelorgullo, #TRANSFOBIA, #transexuales, #LGTB, #LGTBI, #LGTBIQ, #LGTBQ, #LGTBQ+, anti-gay, "anti gay", anti-trans, "anti trans", "Ley Anti-LGTB", "ley trans", "anti-ley trans". This dataset collected in the frame of the NON-CONSPIRA-HATE! project had the aim of identifying and mapping online hate speech narratives and conspiracy theories towards LGBTIQ+ people and community. Additionally, the dataset is intended to compare communication patterns in social media (rhetoric, language, micro-discourses, semantic networks, emotions, etc.) deployed in different datasets collected in this project. This dataset also contributes to mapping the actors, communities, and networks that spread hate messages and conspiracy theories, aiming to understand the patterns and strategies implemented by extremist sectors on social media. he dataset includes messages that address a wide range of topics related to the LGBTQI+ community, such as rights, visibility, the fight against discrimination and transphobia, as well as debates surrounding the Trans Law and other related issues. It includes expressions of support and celebration of Pride as well as hate speech and opposition to LGBTQI+ rights, along with debates and controversies surrounding these issues. This dataset offers a wide range of possibilities for research in various disciplines, as the following examples express: Social Sciences & Digital Humanities:- Analysis of opinions, attitudes, and trends toward the LGBTIQ+ people and community.- Studies on the evolution of public discourse and polarization around issues such as transphobia, hate speech, disinformation, LGBTIQ+ rights and pride, and others.- Analysis on social and political actors, leaders or organizations disseminating diverse narratives on LGBTIQ+ - Research on the impact of specific events (e.g., Pride Day) on social media conversations.- Investigations on social and semantic networks around LGBTIQ+ people and community.- Analysis of narratives, discourses and rethoric around gender identity and sexual diversity.- Comparative studies on the representation of the LGBTIQ+ people and community in different cultural or geographic contexts. Computer Science and Artificial Intelligence:- Development of algorithms for the automatic detection of hate speech, discriminatory language, or offensive content.- Training natural language processing (NLP) models to analyze sentiments and emotions in texts related to the LGBTIQ+ people and community. For more information on other technical details of the dataset and the structure of the .jsonl data, see the “Readme.txt” file.
Lesbian, Twitter, Other Genders, Intersex Persons, Transgender Persons, Conspiracy Theories, Online Hate Speech, LGBTIQ+, Gay, Computational Sociology, Queer/Questioning, Computational Social Science, Disinformation, Bisexual, Anti-Trans law, Social Media, Transphobia
Twitter Data
Lesbian, Twitter, Other Genders, Intersex Persons, Transgender Persons, Conspiracy Theories, Online Hate Speech, LGBTIQ+, Gay, Computational Sociology, Queer/Questioning, Computational Social Science, Disinformation, Bisexual, Anti-Trans law, Social Media, Transphobia
Twitter Data
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
