Downloads provided by UsageCounts
The usage of LeetSpeak and other text hiding tricks is often used by spammers in the distribution of unsolicited contents. To evaluate deobfuscation techniques and their impact on spam content classification, we preprocessed several popular public datasets to partially obfuscate the text. The datasets transformed are: YouTube Spam Collection [2, 3] which is available on https://www.dt.fee.unicamp.br/~tiago/youtubespamcollection/. a subset of YouTube Comments [4, 5] which is available on http://mlg.ucd.ie/yt/. CSDMC2010 which is available on http://csmining.org/index.php/spam-email-datasets-.html. TREC2007 which is available on https://plg.uwaterloo.ca/~gvcormac/treccorpus07/
Leetspeak, Text Deobfuscation, Spam filtering
Leetspeak, Text Deobfuscation, Spam filtering
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 21 | |
| downloads | 5 |

Views provided by UsageCounts
Downloads provided by UsageCounts