Name: Efficient Effective Duplicate Detection Hierarchical Data-Myprojectbazaar
Brand: Datamining
SKU: PROJ1376
Price: 4500.00 INR
Availability: InStock

Product Description

Efficient and Effective Duplicate Detection in Hierarchical Data

Abstract— Efficient and Effective Duplicate Detection in Hierarchical Data. Although there is a long line of work on identifying duplicates in relational data, only a few solutions focus on duplicate detection in more complex hierarchical structures, like XML data. In this paper, we present a novel method for XML duplicate detection, < Final Year Projects > called XMLDup. XMLDup uses a Bayesian network to determine the probability of two XML elements being duplicates, considering not only the information within the elements, but also the way that information is structured. In addition, to improve the efficiency of the network evaluation, a novel pruning strategy, capable of significant gains over the unoptimized version of the algorithm, is presented. Through experiments, we show that our algorithm is able to achieve high precision and recall scores in several data sets. XMLDup is also able to outperform another state-of-the-art duplicate detection solution, both in terms of efficiency and of effectiveness.

Video

View Demo

Including Packages

Our Specialization

Support Service

CUSTOMER SUPPORT

Call us +91 967-778-1155

HAPPY CUSTOMERS

Read the testimonials

LATEST NEWS

enjoy our blog

Statistical Report

satisfied customers

3,589

Freelance projects

983

sales on Site

11,021

developers

175+

Additional Information

Domains	Datamining
Programming Language	Dotnet

Cart

CUSTOMER SUPPORT

Efficient and Effective Duplicate Detection in Hierarchical Data

Product Description

Including Packages

Our Specialization

Support Service

CUSTOMER SUPPORT

Call us +91 967-778-1155

HAPPY CUSTOMERS

Read the testimonials

LATEST NEWS

enjoy our blog

Statistical Report

Additional Information

Related products

Mining User Queries with Markov Chains: Application to Online Image Retrieval

Share on:

A Probabilistic Misbehavior Detection Scheme towards Efficient Trust Establishment in Delay-tolerant Networks

Share on:

Automatic Semantic Content Extraction in Videos Using a Fuzzy Ontology and Rule-Based Model

Share on:

Share on: