
doi: 10.1109/wkdd.2010.50
With the rapid growth of the SMS, the filtration to all messages has been unable to meet the real-time processing requirement. In this paper, we propose a sampling of mass SMS filtering algorithm based on frequent time-domain area to solve this problem. First, we collect the long-running system log. And then analyze the time and domain features of the messages to generate the time-domain strategy. Finally we predict the potential spam messages’ rate in different domain and different time, and carries on the filtration according to each rate separately. This algorithm can satisfy the real-time filtration requirement of the mass SMS stream, and meanwhile there is no significant reduction in spam.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
