
doi: 10.5120/7869-1145
In the latest years many different models for semi structured data have been proposed; most of them, however, are too specific to allow immediate comparison with other models, and do not easily support incremental model design. A number of features which can be considered interesting for a semi structured data model are listed. E.g. we review the more widely used models in Conceptual Modeling for Information Systems (Entity-Relationship and UML), and argue that they do not support effectively modeling of semi structured data. As a consequence, structured and semi structured data cannot be treated in an integrated, holistic way during requirements specification. Research about similarity between semi structured documents (particularly for XML documents) has produced many works in the areas of Database Systems, Artificial Intelligence and Data Mining. In this work introducing a brief survey about it. At first introducing some basic properties. After that, some works are reviewed, highlighting their particularities and general approach. Concluding with a comparison of these works, analyzing their benefits and problems.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
