
This dataset contains 524,406 individual professional credits from 12,820 episodes of Italian TV scripted series broadcast between 2000 and 2023. Each row represents a single professional contribution (e.g., one person working on one episode in one role). Data was extracted from IMDb in April 2024 for the study "Women Off-Screen: A Data-Driven Analysis of Gender Inequalities in Italian TV Serial Production (2000-2023)." The dataset covers credits from 25,538 unique professionals across cast and crew roles. VARIABLES: professional_name: Full name of the professional as listed on IMDb; professional_code: Unique IMDb identifier for the person (nconst format, e.g., nm0188743), which can be used to access their page at https://www.imdb.com/name/[code]; professional_role: The specific job role/position held (e.g., "writer", "director", "actor", "cinematographer"); TV_series_title: Title of the TV series; TV_series_code: IMDb identifier for the series (tconst format, e.g., tt0086779), which can be used to access the series page at https://www.imdb.com/title/[code]); episode_title: Title of the specific episode; episode_code: IMDb identifier for the episode (tconst format, e.g., tt0086779), which can be used to access the episodes page at https://www.imdb.com/title/[code]); episode_year: Year in which the episode aired; NOTES: This is an episode-level dataset: if a professional worked on multiple episodes in the same or different roles, they appear multiple times (one row per episode per role). To obtain unique professionals count, aggregate by professional_code. To calculate workload per professional, count the number of episodes (distinct episode_code) per professional_code. The 12,820 episodes represent 49.12% of the total 26,101 episodes aired during 2000-2023 (see related "IMDb-based Italian TV Serial Production (2000-2023): Series Metadata and Episode Coverage" dataset for complete series catalog). FILE FORMAT: CSV (Comma-Separated Values), UTF-8 encoding. The file can be opened with spreadsheet software (Excel, LibreOffice Calc, Google Sheets) or imported into statistical analysis software (R, Python, SPSS, Stata).
