Exploring the Relevance of Search Engines: An Overview of Google as a Case Study

Authors

  • Ricardo Beltrán Alfonso Universidad Distrital Francisco José de Caldas image/svg+xml
  • Andres Torres Tautiva Universidad Distrital Francisco José de Caldas image/svg+xml
  • Paulo Alonso Gaona García Universidad Distrital Francisco José de Caldas image/svg+xml
  • Carlos Enrique Montenegro Marín Universidad Distrital Francisco José de Caldas image/svg+xml

DOI:

https://doi.org/10.9781/ijimai.2017.449

Keywords:

Algorithms, Google, Search
Supporting Agencies
The authors gratefully acknowledged to Universidad Distrital Francisco José de Caldas trough GIIRA Research group.

Abstract

The huge amount of data on the Internet and the diverse list of strategies used to try to link this information with relevant searches through Linked Data have generated a revolution in data treatment and its representation. Nevertheless, the conventional search engines like Google are kept as strategies with good reception to do search processes. The following article presents a study of the development and evolution of search engines, more specifically, to analyze the relevance of findings based on the number of results displayed in paging systems with Google as a case study. Finally, it is intended to contribute to indexing criteria in search results, based on an approach to Semantic Web as a stage in the evolution of the Web.

Downloads

Download data is not yet available.

References

Askitas, N., & Zimmermann, K. F. (2015). The internet as a data source for advancement in social sciences. International Journal of Manpower, 36(1), 2–12. https://doi.org/10.1108/IJM-02-2015-0029

Stumme, G., Hotho, A., & Berendt, B. (2006). Semantic Web Mining: State of the art and future directions. Web Semantics, 4(2), 124–143. https://doi.org/10.1016/j.websem.2006.02.001

Zhao, J., Lu, X., Wang, X., & Ma, Z. (2015). Web information credibility: From Web 1.0 to Web 2.0. International Journal of u- and e-Service, Science and Technology, 8, 161–172. https://doi.org/10.14257/ijunesst.2015.8.7.16

Fritch, J. W. (2003). Heuristics, tools, and systems for evaluating Internet information: Helping users assess a tangled Web. Online Information Review, 27, 321–327. https://doi.org/10.1108/14684520310502270

Frazier, B. (2013). Niche search engines: Expanding information discovery. The Reference Librarian, 54, 168–174. https://doi.org/10.1080/02763877.2013.755440

Suárez-Barón, M., & Salinas-Valencia, K. (2009). An approach to semantic indexing and information retrieval. Revista Facultad de Ingeniería Universidad de Antioquia, (48), 174–187.

Bizer, C., Heath, T., & Berners-Lee, T. (2009). Linked data—The story so far. In Semantic Services, Interoperability and Web Applications: Emerging Concepts (pp. 205–227). https://doi.org/10.4018/jswis.2009081901

Andrienko, N., & Andrienko, G. (2013). Visual analytics of movement: An overview of methods, tools and procedures. Information Visualization, 12(1), 3–24. https://doi.org/10.1177/1473871612457601

Keim, D., Andrienko, G., Fekete, J.-D., Görg, C., Kohlhammer, J., & Melançon, G. (2008). Visual analytics: Definition, process, and challenges. Lecture Notes in Computer Science, 4950, 154–175. https://doi.org/10.1007/978-3-540-70956-5_7

Kohlhammer, J., Keim, D., Pohl, M., Santucci, G., & Andrienko, G. (2011). Solving problems with visual analytics. Procedia Computer Science, 7, 117–120. https://doi.org/10.1016/j.procs.2011.12.035

Gaona-García, P. A., Martín-Moncunill, D., Sánchez-Alonso, S., & Fermoso, A. (2014). A usability study of taxonomy visualisation user interfaces in digital repositories. Online Information Review, 38(2), 284–304. https://doi.org/10.1108/OIR-03-2013-0051

Gaona-García, P. A., Sánchez-Alonso, S., & Montenegro Marín, C. E. (2014). Visualization of information: A proposal to improve the search and access to digital resources in repositories. Ingeniería e Investigación, 34, 83–89. https://doi.org/10.15446/ing.investig.v34n1.39449

Gaona-García, P. A., Stoitsis, G., Sánchez-Alonso, S., & Biniari, K. (2016). An exploratory study of user perception in visual search interfaces based on SKOS. Knowledge Organization, 43(4).

Edosomwan, J., & Edosomwan, T. O. (2010). Comparative analysis of some search engines. South African Journal of Science, 106(11–12). https://doi.org/10.4102/sajs.v106i11/12.169

Tümer, D., Shah, M. A., & Bitirim, Y. (2009). An empirical evaluation on semantic search performance of keyword-based and semantic search engines: Google, Yahoo, MSN and Hakia. In Proceedings of the 4th International Conference on Internet Monitoring and Protection (ICIMP 2009) (pp. 51–55). https://doi.org/10.1109/ICIMP.2009.16

Gupta, P., Singh, S. K., Yadav, D., & Sharma, A. K. (2013). An improved approach to ranking web documents. Journal of Information Processing Systems, 9(2), 217–236. https://doi.org/10.3745/JIPS.2013.9.2.217

Haveliwala, T. H. (1999). Efficient computation of PageRank (Technical Report). Stanford University.

Haveliwala, T. H. (2003). Topic-sensitive PageRank: A context-sensitive ranking algorithm for Web search. IEEE Transactions on Knowledge and Data Engineering, 15(4), 784–796. https://doi.org/10.1109/TKDE.2003.1208999

Kliman-Silver, C., Hannak, A., Lazer, D., Wilson, C., & Mislove, A. (2015). Location, location, location: The impact of geolocation on web search personalization. In Proceedings of the ACM SIGCOMM Internet Measurement Conference (IMC 2015) (pp. 121–127). https://doi.org/10.1145/2815675.2815714

Zhang, W., & Chen, Y. (2014). Bayes topic prediction model for focused crawling of vertical search engine. In 2014 IEEE Computers, Communications and IT Applications Conference (ComComAp 2014) (pp. 294–299). https://doi.org/10.1109/ComComAp.2014.7017213

Zhang, L., Song, H., Yu, S., & Ma, F. (2004). Design and implementation of a high-performance distributed web crawler. Journal of Shanghai Jiaotong University, 38, 59–61.

Liu, L., & Peng, T. (2013). Post-processing of deep web information extraction based on domain ontology. Advances in Electrical and Computer Engineering, 13(4), 25–32. https://doi.org/10.4316/AECE.2013.04005

Brin, S. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks, 30(1–7), 107–117.

Kausar, M. A., Dhaka, V. S., & Singh, S. K. (2013). Web crawler: A review. International Journal of Computer Applications, 63(2). https://doi.org/10.5120/10440-5125

Martínez-Rodríguez, A. (2006). Cybermetric indicators: New proposals to measure information in the digital environment. ACIMED, 14(4).

Ahmadi-Abkenari, F., & Selamat, A. (2012). An architecture for a focused trend parallel Web crawler with the application of clickstream analysis. Information Sciences, 184(1), 266–281. https://doi.org/10.1016/j.ins.2011.08.022

Raval, V., & Kumar, P. (2012). SEReleC (Search Engine Result Refinement and Classification): A meta search engine based on combinatorial search and search keyword based link classification. In IEEE International Conference on Advances in Engineering, Science and Management (ICAESM 2012) (pp. 627–631).

Kim, K. S., Kim, K. Y., Lee, K. H., Kim, T. K., & Cho, W. S. (2012). Design and implementation of web crawler based on dynamic web collection cycle. In International Conference on Information Networking (pp. 562–566). https://doi.org/10.1109/ICOIN.2012.6164440

Brin, S., & Page, L. (2012). Reprint of: The anatomy of a large-scale hypertextual web search engine. Computer Networks, 56(18), 3825–3833. https://doi.org/10.1016/j.comnet.2012.10.007

Dikaiakos, M. D., Stassopoulou, A., & Papageorgiou, L. (2005). An investigation of web crawler behavior: Characterization and metrics. Computer Communications, 28(8), 880–897. https://doi.org/10.1016/j.comcom.2005.01.003

Whiting, M. A., & Cramer, N. (2002). WebThemeTM: Understanding web information through visual analytics. Lecture Notes in Computer Science (Vol. 2342, pp. 460–468). https://doi.org/10.1007/3-540-48005-6_41

Boldi, P., Codenotti, B., Santini, M., & Vigna, S. (2004). UbiCrawler: A scalable fully distributed web crawler. Software—Practice & Experience, 34(8), 711–726. https://doi.org/10.1002/spe.587

Singhal, N., Dixit, A., Agarwal, R. P., & Sharma, A. K. (2012). Regulating frequency of a migrating web crawler based on users interest. International Journal of Engineering Technology, 4(4), 246–253.

Shettar, R., & Shobha, G. (2008). Web crawler on client machine. In Proceedings of the International MultiConference of Engineers and Computer Scientists (Vol. 2).

Nielsen, J. (1994). Usability inspection methods. In Proceedings of the Conference on Human Factors in Computing Systems (CHI ’94) (pp. 413–414). https://doi.org/10.1145/259963.260531

Maguire, M. (2001). Methods to support human-centred design. International Journal of Human–Computer Studies, 55(4), 587–634. https://doi.org/10.1006/ijhc.2001.0503

Marzal, M. A., Calzada-Prado, J., Ruiz, M. J. C., & Cerveró, A. C. (2006). Development of a controlled vocabulary for learning objects’ functional description in an educational repository. In Proceedings of the International Conference on Dublin Core and Metadata Applications.

Manjula, D., & Geetha, T. V. (2004). Semantic search engine. Journal of Information & Knowledge Management, 3(1), 107–117. https://doi.org/10.1142/S0219649204000729

Cobos, C., León, E., & Mendoza, M. (2010). A harmony search algorithm for clustering with feature selection. Revista Facultad de Ingeniería Universidad de Antioquia, (55), 153–164.

Serón, F. J., & Bobed, C. (2016). VOX system: A semantic embodied conversational agent exploiting linked data. Multimedia Tools and Applications, 75(1), 381–404. https://doi.org/10.1007/s11042-014-2295-5

Downloads

Published

2017-06-01
Metrics
Views/Downloads
  • Abstract
    100
  • PDF
    56

How to Cite

Beltrán Alfonso R., Torres Tautiva, A., Gaona García, P. A., and Montenegro Marín, C. E. (2017). Exploring the Relevance of Search Engines: An Overview of Google as a Case Study. International Journal of Interactive Multimedia and Artificial Intelligence, 4(4), 72–79. https://doi.org/10.9781/ijimai.2017.449

Most read articles by the same author(s)

1 2 > >>