
We present a new practical algorithm to resolve the experimental data of restriction site analysis, which is a common technique for mapping DNA. Specifically, we assert that multiple digests with a single restriction enzyme can provide sufficient information to identify the positions of the restriction sites with high probability. The motivation for the new approach comes from combinatorial results on the number of mutually homeometric sets in one dimension, where two sets of n points are homeometric if the multiset of (n2) distances they determine are the same. Since experimental data contains error, we propose algorithms for reconstructing sets from noisy interpoint distances, including the possibility of missing fragments. We analyze the performance of these algorithms under a reasonable probability distribution, establishing a relative error limit of r = theta (1/n2) beyond which our technique becomes infeasible. Through simulations, we establish that our technique is robust enough to reconstruct data with relative errors of up to 7.0% in the measured fragment lengths for typical problems, which appears sufficient for certain biological applications.
number of mutually homeometric sets, Computing methodologies and applications, algorithm, multiple digestions, Base Sequence, Biochemistry, molecular biology, restriction site analysis, noisy interpoint distances, Restriction Mapping, missing fragments, Combinatorics in computer science, DNA, DNA Restriction Enzymes, Sequence Analysis, DNA, Protein sequences, DNA sequences, Bacteriophage lambda, relative error limit, Substrate Specificity, Discrete mathematics in relation to computer science, simulations, Computational methods for problems pertaining to biology, Enumerative combinatorics, Mathematics, Algorithms, single restriction enzyme
number of mutually homeometric sets, Computing methodologies and applications, algorithm, multiple digestions, Base Sequence, Biochemistry, molecular biology, restriction site analysis, noisy interpoint distances, Restriction Mapping, missing fragments, Combinatorics in computer science, DNA, DNA Restriction Enzymes, Sequence Analysis, DNA, Protein sequences, DNA sequences, Bacteriophage lambda, relative error limit, Substrate Specificity, Discrete mathematics in relation to computer science, simulations, Computational methods for problems pertaining to biology, Enumerative combinatorics, Mathematics, Algorithms, single restriction enzyme
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 46 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
