Downloads provided by UsageCounts
Social media is a rich source of assertions about personal attributes, such as ``I am a doctor'' or ``my hobby is playing tennis.'' Precisely identifying explicit assertions is difficult, though, because of the users' highly varied vocabulary and language expressions. Identifying implicit assertions like ``I've been at work treating patients all day'' is even more challenging. We present RedDust data resource consisting of personal attribute labels for over 300k Reddit users across five predicates: profession, hobby, family status, age, and gender. We construct RedDust using a diverse set of high-precision patterns To the best of our best knowledge, RedDust is the first semantic data resource about Reddit users at large scale. We envision further use cases of RedDust for providing background knowledge about user traits, to enhance personalized search and recommendation as well as conversational agents.
reddit, user attributes
reddit, user attributes
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 9 | |
| downloads | 52 |

Views provided by UsageCounts
Downloads provided by UsageCounts