
This paper presents a collection of algorithms addressing two related problems in web data management: automatic wrapper generation for structured data extraction and efficient compression of large-scale graph structures. For the extraction problem, we develop a method for on-the-fly wrapper creation that leverages XPath expressions ranked by their discriminative power over HTML and XML document collections. For the compression problem, we propose techniques for reducing the storage requirements of adjacency list representations, with particular focus on the structural properties exhibited by web graphs and social networks. Additionally, we investigate the relationship between maximum flow and minimum cut in capacitated networks, presenting bounds on the max-flow min-cut gap and approximation algorithms for the multicut problem. Finally, we address fault tolerance in mesh-connected architectures through deep emulations that enable a fault-free mesh to be simulated on a mesh containing random faults.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
