
doi: 10.26108/yzrd-5p13
'Information Filtering' is the problem of extracting data we desire from a corpus of data. The task of 'filtering' depends heavily on the format of that corpus. For example, the corpus may be a stationary relational database, flat-file, collection of files, or, in the case of a news feed, may be continually 'streamed' to clients. In this research, we examine the problem of filtering data from a continual ' stream' that has been compressed with an 'online', XML-conscious compressor. We introduce a filtering system that leverages the format of compressed XML streams to provide 'subscription-oriented ' filtering of the data contained in the stream. Additionally, the system provides 'persistence' by efficiently storing the ' compressed' results in relational tables, allowing subscribers to received filtered content even if they are disconnected when filtering occurs. These features make our system useful in applications where XML data is continually streamed such as news-feeds, scientific document notification services or in XML routing applications.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
