Please use this identifier to cite or link to this item: http://www.dspace.espol.edu.ec/handle/123456789/7701
Full metadata record
DC FieldValueLanguage
dc.contributor.authorVaras Palomeque, Irene Carolina-
dc.contributor.authorPaladines Herrera, Gabriel Antonio-
dc.contributor.authorAbad, Cristina-
dc.date.accessioned2009-10-15-
dc.date.available2009-10-15-
dc.date.issued2009-10-15-
dc.identifier.urihttp://www.dspace.espol.edu.ec/handle/123456789/7701-
dc.description.abstractIn this project we created a regular expressions search engine that uses the Wikipedia database of articles. The system allows the use of to enter a regular expression and makes an asynchronous request to initialize an EC2 cluster; it searches for the pattern inside all the Wikipedia and then returns the result, displaying a list of all the occurrences of the pattern and a link to the Wikipedia Article. We used the Amazon Web Services, Java libraries to manipulate Wikipedia Articles, the Hadoop framework and a dataset of the Wikipedia Articles. We tested some regular expressions that couldn’t be searched for using neither traditional search engines nor the Wikipedia Search Engine. Our tests show that an advanced search engine could be cheap to implement providing high scalability through the use of cloud computing and data-intensive computing techniques.en
dc.language.isospaen
dc.rightsopenAccess-
dc.subjectHADDOPen
dc.subjectCLOUD COMPUTINGen
dc.subjectMAPREDUCEen
dc.subjectELASTIC MAPREDUCEen
dc.subjectSIMPLE STORAGE SERVICE S3en
dc.subjectWIKIPEDIAen
dc.subjectDATASETen
dc.subjectCLÚSTER EC2.en
dc.titleWikigrep distribuido: búsquedas avanzadas en la wikipediaen
dc.typeArticleen
Appears in Collections:Artículos de Tesis de Grado - FIEC



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.