Downloads provided by UsageCounts
Sentiment analysis, also known as opinion mining, plays a big role in both private and public sector Business Intelligence (BI); it attempts to improve public and customer experience. Nevertheless, de-identified sentiment scores from public social media posts can compromise individual privacy due to their vulnerability to record linkage attacks. Established privacy-preserving methods like k-anonymity, l-diversity and t-closeness are offline models exclusively designed for data at rest. Recently, a number of online anonymization algorithms (CASTLE, SKY, SWAF) have been proposed to complement the functional requirements of streaming applications, but without open-source implementation. In this paper, we present a reusable Apache NiFi dataflow that buffers tweets from multiple edge devices and performs anonymized sentiment analysis in real-time, using randomization. The solution can be easily adapted to suit different scenarios, enabling researchers to deploy custom anonymization algorithms.
Abhinay Pandya, Panos Kostakos, Hassan Mahmood, Marta Cortes, Ekaterina Gilman, Mourad Oussalah and Susanna Pirttikangas, "Privacy preserving sentiment analysis on multiple edge data streams with Apache NiFi", in Proc. European Intelligence and Security Informatics Conference (EISIC 2019), Oulu, Finland, November 2019. (DOI: https://doi.org/10.1109/EISIC49498.2019.9108851)
Apache NiFi, social media, sentiment analysis, IoT privacy, anonymization
Apache NiFi, social media, sentiment analysis, IoT privacy, anonymization
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 10 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 8 | |
| downloads | 32 |

Views provided by UsageCounts
Downloads provided by UsageCounts