Benvenuto in Scribd!

Assign CS614 3 FAll 2011 Sol

Caricato da

Il 0% ha trovato utile questo documento (0 voti)

11 visualizzazioni3 pagine

FUZZY-FINGERPRINTING and LOCALITY-SENSITIVE hashing are used in this paper to search the text. Which hashing technique out of these two is best in your point of view? Which task(s) is more suitable for a text based search retrieval? Provide reasons to support your answer.

Descrizione originale:

Copyright

Formati disponibili

DOC, PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato DOC, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

11 visualizzazioni3 pagine

Assign CS614 3 FAll 2011 Sol

Caricato da

Asad Amanat

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato DOC, PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 3

Cerca all'interno del documento

Assignment No.

03 SEMESTER FALL 2011 CS614- Data Warehousing Asad Amanat BC070400930 Marks: 20
Total Marks: 20 Due Date: 27/12/2011

Question 1: [ 13 marks ] How FUZZY-FINGERPRINTING and LOCALITY-SENSITIVE HASHING are used in this paper to search the text? Which hashing techniques out of these two is best in your point of view? Justify your answer with reasons. Solution:

The paper introduced quite 2 different technique construction for hash based indexing. These principles are driven from fuzzy finger printing and locality sensitive respectively. According to an analysis of both hashing approaches to show their applicability for near duplication task and similarity search task and then we compare in term of precision and recall. The result of our search says that fuzzy finger printing outperform the locality sensitive hashing in task of near duplicate detection. Within the similarity task fuzzy finger printing achieves a clearly higher precision as compared to locality sensitive. Actually the locality sensitive hashing technique is used to control the different kind of high-dimensional vector based object representation. While the Fuzzy Finger-print is used for other domain of interests. Stated that the object of this domain can be characterized with a small set of discriminative features. In my opinion fuzzy Finger printing is best because our search based on the theoretical analysis of similarity of hash function and then focus on the on the practical implementation. We want to quantify the relation between the determinants of the Fuzzy Finger printing and then achieved the retrieval performance in order to build the hash index for special purpose retrieval task. Thats why we apply Fuzzy Finger printing as a main and vital technology in our text based plagiarism analysis.
Question 2: [ 7 marks ] In this paper three fundamental text retrieval tasks where hash-based indexing can be applied are discussed that are: (i) grouping, (ii) similarity search and (iii) classification.

Which task(s) is more suitable for a text based search retrieval? Provide reasons to support your answer. Solution:

Three fundamental text retrieval task where hash based indexing can be applied are grouping similarity search and classification. These are used for text based retrieval task in hash based indexing. Grouping and classification are similar among their functionality. Both used for finding the large results that need to be refined and visually prepared and clean Rom duplicates. Classification is used with a small number of classes. Out of these three the similarity search is most suited text retrieval task that is based on term query. It has some key feature. Many large search engines like Google, yahoo and Alta Vista use satisfying these kinds of information needs. To identify the similar document appropriate key words are extracted from document query. A no. k of term query are formed from these key words and the respective result sets match with document query. In this way we text based retrieval performs. It can assume dramatic proportion. This situation is aggravated if an application like plagiarism analysis requires the segmentation of the query document in order to realize a similarity search for one paragraph at a time. It is not as suitable for document to document similarity search as for short user queries. Its improves the search quality and search efficiency too. On a large collection of document with a very small amount of data we can perform this technique. We can say that our scheme outperform the slandered textual similarity search on the inverted representation both in term of quality and efficiency.

Potrebbero piacerti anche

Re-Interview Information
Documento40 pagine
Re-Interview Information
Ahmad Allbab
Nessuna valutazione finora
Volume 2 Issue 6 2016 2020
Documento5 pagine
Volume 2 Issue 6 2016 2020
saman
Nessuna valutazione finora
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
Documento11 pagine
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
IAEME Publication
Nessuna valutazione finora
Measuring Semantic Similarity Between Words and Improving Word Similarity by Augumenting PMI
Documento5 pagine
Measuring Semantic Similarity Between Words and Improving Word Similarity by Augumenting PMI
International Journal of Application or Innovation in Engineering & Management
Nessuna valutazione finora
Csit 3101
Documento11 pagine
Csit 3101
saman
Nessuna valutazione finora
Clustering Algorithm PDF
Documento5 pagine
Clustering Algorithm PDF
isssac_Abraham
Nessuna valutazione finora
Hybrid Technique For Enhanced Searching of Text Documents: Volume 2, Issue 4, April 2013
Documento5 pagine
Hybrid Technique For Enhanced Searching of Text Documents: Volume 2, Issue 4, April 2013
International Journal of Application or Innovation in Engineering & Management
Nessuna valutazione finora
Iaetsd-Jaras-Comparative Analysis of Correlation and Pso
Documento6 pagine
Iaetsd-Jaras-Comparative Analysis of Correlation and Pso
iaetsdiaetsd
Nessuna valutazione finora
Expert Systems With Applications ResuMat
Documento14 pagine
Expert Systems With Applications ResuMat
Asma Benmassoud
Nessuna valutazione finora
Performance Evaluation of Query Processing Techniques in Information Retrieval
Documento6 pagine
Performance Evaluation of Query Processing Techniques in Information Retrieval
idescitation
Nessuna valutazione finora
A Secure and Dynamic Multi-Keyword Ranked Search Scheme Over Encrypted Cloud Data
Documento32 pagine
A Secure and Dynamic Multi-Keyword Ranked Search Scheme Over Encrypted Cloud Data
m.muthu lakshmi
Nessuna valutazione finora
Evaluation of Result Merging Strategies PDF
Documento14 pagine
Evaluation of Result Merging Strategies PDF
Vijay Karthi
Nessuna valutazione finora
Measure Term Similarity Using A Semantic Network Approach
Documento5 pagine
Measure Term Similarity Using A Semantic Network Approach
BOHR International Journal of Computer Science (BIJCS)
Nessuna valutazione finora
A Review On Query Clustering Algorithms For Search Engine Optimization
Documento5 pagine
A Review On Query Clustering Algorithms For Search Engine Optimization
editor_ijarcsse
Nessuna valutazione finora
2 14 1625295578 2ijcseitrdec20212
Documento10 pagine
2 14 1625295578 2ijcseitrdec20212
TJPRC Publications
Nessuna valutazione finora
Search Engine Personalization Tool Using Linear Vector Algorithm
Documento9 pagine
Search Engine Personalization Tool Using Linear Vector Algorithm
ahmed_trab
Nessuna valutazione finora
A Web Search Engine-Based Approach To Measure Semantic Similarity Between Words
Documento14 pagine
A Web Search Engine-Based Approach To Measure Semantic Similarity Between Words
Haritha Chowdary
Nessuna valutazione finora
U10a1 DATA ANALYTICS INTERNSHIP TEXT MINING APPLICATIONS Hal Hagood Dante Durrman
Documento8 pagine
U10a1 DATA ANALYTICS INTERNSHIP TEXT MINING APPLICATIONS Hal Hagood Dante Durrman
HalHagood
Nessuna valutazione finora
Text Classification MLND Project Report Prasann Pandya
Documento17 pagine
Text Classification MLND Project Report Prasann Pandya
Raja Purba
Nessuna valutazione finora
Information Retrieval System
Documento4 pagine
Information Retrieval System
Sonu Davidson
Nessuna valutazione finora
Abstract
Documento4 pagine
Abstract
ahyan.ingenium
Nessuna valutazione finora
Measuring Semantic Similarity Between Words Using Web Search Engines
Documento10 pagine
Measuring Semantic Similarity Between Words Using Web Search Engines
saman
Nessuna valutazione finora
Web Search and Geographic Location: Mikew@sims - Berkeley.edu
Documento7 pagine
Web Search and Geographic Location: Mikew@sims - Berkeley.edu
Jen
Nessuna valutazione finora
Automatic Image Annotation: Fundamentals and Applications
Da Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
Nessuna valutazione finora
Ijaiem 2014 05 31 127
Documento7 pagine
Ijaiem 2014 05 31 127
International Journal of Application or Innovation in Engineering & Management
Nessuna valutazione finora
Ranking and Searching of Document With New Innovative Method in Text Mining: First Review
Documento7 pagine
Ranking and Searching of Document With New Innovative Method in Text Mining: First Review
International Journal of Application or Innovation in Engineering & Management
Nessuna valutazione finora
Research Paper On Information Retrieval System
Documento7 pagine
Research Paper On Information Retrieval System
fys1q18y
100% (1)
Concurrent Context Free Framework For Conceptual Similarity Problem Using Reverse Dictionary
Documento4 pagine
Concurrent Context Free Framework For Conceptual Similarity Problem Using Reverse Dictionary
Editor IJRITCC
Nessuna valutazione finora
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
Documento4 pagine
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
International Journal of computational Engineering research (IJCER)
Nessuna valutazione finora
Hybrid Search: Effectively Combining Keywords and Semantic Searches
Documento15 pagine
Hybrid Search: Effectively Combining Keywords and Semantic Searches
Siti Hajar
Nessuna valutazione finora
PDC Review2
Documento23 pagine
PDC Review2
corote1026
Nessuna valutazione finora
Ontology-Based Interpretation of Keywords For Semantic Search
Documento14 pagine
Ontology-Based Interpretation of Keywords For Semantic Search
annanettar
Nessuna valutazione finora
Explain Item Normalization?
Documento7 pagine
Explain Item Normalization?
Shushanth munna
Nessuna valutazione finora
The Research of Method For Blurring Image Reconstruction: Intelligent
Documento6 pagine
The Research of Method For Blurring Image Reconstruction: Intelligent
Varun Kalpurath
Nessuna valutazione finora
swj248 PDF
Documento8 pagine
swj248 PDF
akttripathi
Nessuna valutazione finora
Information Retrieval
Documento5 pagine
Information Retrieval
NB
Nessuna valutazione finora
Literature Review Computer Science Projects
Documento4 pagine
Literature Review Computer Science Projects
c5jxjm5m
100% (1)
Example of Keywords in Research Paper
Documento4 pagine
Example of Keywords in Research Paper
jhnljzbnd
100% (1)
Web Query Mining
Documento16 pagine
Web Query Mining
Muhammad Miftakul Amin
Nessuna valutazione finora
The Technique of Different Semantic Search Engines: Upasana Sinha, Vikas Dubey
Documento6 pagine
The Technique of Different Semantic Search Engines: Upasana Sinha, Vikas Dubey
DavidTumini Ogolo
Nessuna valutazione finora
CLASS10 Fundamentals of Data Structures
Documento6 pagine
CLASS10 Fundamentals of Data Structures
police station
Nessuna valutazione finora
(IJCST-V3I3P47) : Sarita Yadav, Jaswinder Singh
Documento5 pagine
(IJCST-V3I3P47) : Sarita Yadav, Jaswinder Singh
EighthSenseGroup
Nessuna valutazione finora
Discovery of Similarity Computations of Search Engines: King-Lup Liu Weiyi Meng Clement Yu
Documento8 pagine
Discovery of Similarity Computations of Search Engines: King-Lup Liu Weiyi Meng Clement Yu
postscript
Nessuna valutazione finora
5s PDF
Documento9 pagine
5s PDF
Praveen Kumar
Nessuna valutazione finora
A New Survey On Upgrade Query Testimonial Technique Supporting Exploratory Search Using Search Goal Shift Graph
Documento3 pagine
A New Survey On Upgrade Query Testimonial Technique Supporting Exploratory Search Using Search Goal Shift Graph
mukesh poundekar
Nessuna valutazione finora
Irs Unit5
Documento6 pagine
Irs Unit5
vinaynotbinay
Nessuna valutazione finora
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
Documento10 pagine
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
avi
Nessuna valutazione finora
Binary Search Research Paper
Documento5 pagine
Binary Search Research Paper
fys5ehgs
100% (1)
Tzitzikas 2018 ESWC EMSASW
Documento10 pagine
Tzitzikas 2018 ESWC EMSASW
Panagiotis Papadakos
Nessuna valutazione finora
Smriti Mishra
Documento15 pagine
Smriti Mishra
Docukits
Nessuna valutazione finora
MASWS Assignment 3 A Review On Picky, A SW Search Engine: 1 Doing Search Over Semantically Organized Text?
Documento5 pagine
MASWS Assignment 3 A Review On Picky, A SW Search Engine: 1 Doing Search Over Semantically Organized Text?
Qijun Liu
Nessuna valutazione finora
A Language For Manipulating Clustered Web Documents Results
Documento19 pagine
A Language For Manipulating Clustered Web Documents Results
Alessandro Siro Campi
Nessuna valutazione finora
Efficiently Searching Nearest Neighbor in Documents
Documento3 pagine
Efficiently Searching Nearest Neighbor in Documents
International Journal of Research in Engineering and Technology
Nessuna valutazione finora
A Survey On Personalized Meta Search Engine
Documento4 pagine
A Survey On Personalized Meta Search Engine
editor_ijarcsse
Nessuna valutazione finora
Everything in Brief Introduction
Documento5 pagine
Everything in Brief Introduction
02.satya.2001
Nessuna valutazione finora
Legal Text Mining
Documento7 pagine
Legal Text Mining
IJRASETPublications
Nessuna valutazione finora
Guidelines For Performing Systematic Literature Reviews in Software Engineering 2007
Documento6 pagine
Guidelines For Performing Systematic Literature Reviews in Software Engineering 2007
afmzhpeloejtzj
Nessuna valutazione finora
Online Handwritten Cursive Word Recognition
Documento40 pagine
Online Handwritten Cursive Word Recognition
kousalya
Nessuna valutazione finora
V3i608 PDF
Documento7 pagine
V3i608 PDF
IJCERT PUBLICATIONS
Nessuna valutazione finora
Dynamic Ranking Algorithm Using Multi Graph Technology
Documento9 pagine
Dynamic Ranking Algorithm Using Multi Graph Technology
Rachel Wheeler
Nessuna valutazione finora
MTH601 2 Sol
Documento2 pagine
MTH601 2 Sol
Asad Amanat
Nessuna valutazione finora
Sta301 GDB Idea Solution
Documento1 pagina
Sta301 GDB Idea Solution
Asad Amanat
Nessuna valutazione finora
The Educators Nuttertools
Documento1 pagina
The Educators Nuttertools
Asad Amanat
Nessuna valutazione finora
Cs201 2 Quiz NimRa
Documento10 pagine
Cs201 2 Quiz NimRa
Asad Amanat
Nessuna valutazione finora
Assignment No. 02: SEMESTER Fall 2010 CS301-Data Structure Kiran Zahra
Documento5 pagine
Assignment No. 02: SEMESTER Fall 2010 CS301-Data Structure Kiran Zahra
Asad Amanat
Nessuna valutazione finora
Mgt411 Idea Solution
Documento1 pagina
Mgt411 Idea Solution
Asad Amanat
Nessuna valutazione finora
8.1.4.7 Packet Tracer - Subnetting Scenario 2
Documento5 pagine
8.1.4.7 Packet Tracer - Subnetting Scenario 2
Ajay
0% (1)
Huawei WLAN Roaming Feature Presentation
Documento9 pagine
Huawei WLAN Roaming Feature Presentation
ismoil
Nessuna valutazione finora
Cables (For FIO)
Documento21 pagine
Cables (For FIO)
muneeb.irfan9873
Nessuna valutazione finora
Resultados de La Web: VGH & UBC Hospital Foundation: Home
Documento9 pagine
Resultados de La Web: VGH & UBC Hospital Foundation: Home
Jo Paternina
Nessuna valutazione finora
CCNA 1 Chapter 10 v5.0 Exam Answers 2015 100
Documento6 pagine
CCNA 1 Chapter 10 v5.0 Exam Answers 2015 100
ovidiu0702
Nessuna valutazione finora
Recorder Duets
Documento11 pagine
Recorder Duets
Pedro Daniel Macalupú Cumpén
92% (13)
Aqui Face
Documento12 pagine
Aqui Face
luantreta619
Nessuna valutazione finora
Consent To Electronic Communications
Documento2 pagine
Consent To Electronic Communications
Vilmarie Rivera
Nessuna valutazione finora
CN Unit-I
Documento167 pagine
CN Unit-I
18W91A0C0
Nessuna valutazione finora
Kim Kardashian Bashes Kanye Amid Divorce Drama
Documento3 pagine
Kim Kardashian Bashes Kanye Amid Divorce Drama
Andrewson Bautista
Nessuna valutazione finora
VITA - EN - PMMG Update 2.2015 V.2.0 20150715 BF - Screen - en
Documento14 pagine
VITA - EN - PMMG Update 2.2015 V.2.0 20150715 BF - Screen - en
Izat Izat
Nessuna valutazione finora
(PDF) Er Prinzipito en Andalú. (Primerah Z'ohah Der Libro + Komentarioh) Huan Porrah Blanko - Academia - Edu
Documento6 pagine
(PDF) Er Prinzipito en Andalú. (Primerah Z'ohah Der Libro + Komentarioh) Huan Porrah Blanko - Academia - Edu
Esther Miguel Trula
Nessuna valutazione finora
VB5 Cracking With SmartCheck 5.0
Documento5 pagine
VB5 Cracking With SmartCheck 5.0
Sanjay Patil
Nessuna valutazione finora
CIT Notes: Prof. Rakhi Tripathi & Prof. Rajneesh Chauhan
Documento46 pagine
CIT Notes: Prof. Rakhi Tripathi & Prof. Rajneesh Chauhan
Saket Nandan
Nessuna valutazione finora
ForexRealProfitEA v5.11 Manual 12.21.2010
Documento29 pagine
ForexRealProfitEA v5.11 Manual 12.21.2010
RODRIGO TROCONIS
100% (1)
Infoblox Installation Guide 805 Series Appliances
Documento20 pagine
Infoblox Installation Guide 805 Series Appliances
rinsonjohnp
Nessuna valutazione finora
Lesson Plan Ethical and Social Responsibility - 0
Documento8 pagine
Lesson Plan Ethical and Social Responsibility - 0
nicodemus balasuela
Nessuna valutazione finora
Cyber Safety
Documento42 pagine
Cyber Safety
Dhruval
Nessuna valutazione finora
Prepare Level 3 Achievement Test 3 U9-12
Documento2 pagine
Prepare Level 3 Achievement Test 3 U9-12
Maryna Mykhailova
Nessuna valutazione finora
An Tororo
Documento2 pagine
An Tororo
khauka ronald
Nessuna valutazione finora
OpenText - OEM Sales Training - Product Descriptions, Deep Dives, & Bundles
Documento179 pagine
OpenText - OEM Sales Training - Product Descriptions, Deep Dives, & Bundles
Mahendra Reddy
Nessuna valutazione finora
Cisco SPA 502G 1-Line IP Phone
Documento5 pagine
Cisco SPA 502G 1-Line IP Phone
mastr
Nessuna valutazione finora
Why Redawning Is The Smart Choice: One Stop. A Complete Suite of Marketing Solutions
Documento2 pagine
Why Redawning Is The Smart Choice: One Stop. A Complete Suite of Marketing Solutions
Mandie Holmes
Nessuna valutazione finora
Connected
Documento305 pagine
Connected
hoho
Nessuna valutazione finora
400 Series User Manual
Documento40 pagine
400 Series User Manual
tmtt44
Nessuna valutazione finora
How2r1 Datasheet
Documento3 pagine
How2r1 Datasheet
Mike Perez
Nessuna valutazione finora
Configuring OAM 11g Server in CERT Mode
Documento3 pagine
Configuring OAM 11g Server in CERT Mode
Denem Orhun
Nessuna valutazione finora
Del Carmen National High School - Caub Extension: February 11, 2020
Documento3 pagine
Del Carmen National High School - Caub Extension: February 11, 2020
Charles Kenn Mantilla
100% (1)
30 Questions To Ask College Admissions
Documento2 pagine
30 Questions To Ask College Admissions
RasmussenCollege
Nessuna valutazione finora
Project Charter
Documento2 pagine
Project Charter
karan
Nessuna valutazione finora