
This article explores the synergy between web scraping techniques and artificial intelligence in extracting, processing, and analyzing large-scale online data. By combining traditional scraping tools with machine learning models, such as transformers and named entity recognition systems, the study demonstrates how raw web data can be transformed into actionable insights. The proposed pipeline was tested on real-world datasets, including news sites and product reviews. Results show significant improvements in data quality, classification accuracy, and analysis speed. This research offers a scalable framework for AI-powered online data mining.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
