Available Challenges and Guidelines in the Field of Deep Web and Intensive Crawling

Ezatdoost, Yasin; Tourani, Ali; Danesh, Amir Seyed
September 2013
International Journal of Computer Applications;Sep2013, Vol. 77, p1
Academic Journal
Today, there is a great deal of information available in Web world and the only way to access them is through search relationships. Web crawler is an automated script that independently browses the web. Web crawler starts its task with a "seed URL" and then traces links available in each page. This encountered many available crawlers with essential difficulties. Identification of search intermediate and selection of a proper inquiry, on one hand, and retrieving documentaries returned by the web as the result, on the other hand, are issues that intensify challenges available for web crawlers. The aim of the present paper is to investigate available challenges and guidelines in the field of deep web and intensive crawling.


Related Articles

  • Multiple perspective interactive search: a paradigm for exploratory search and information retrieval on the web. Singh, Rahul; Hsu, Ya-Wen; Moon, Naureen // Multimedia Tools & Applications;Jan2013, Vol. 62 Issue 2, p507 

    The World Wide Web (WWW) represents the largest and arguably the most complex repository of content at our current state of technological development. Information on the web is represented using a variety of media, with a (current) predominance of text- and images-based data and increasing...

  • WPRiMA Tool: Managing Risks in Web Projects. Al-Rousan, Thamer; Sulaiman, Shahida; Salam, Rosalina Abdul // International Journal of Human & Social Sciences;Oct2010, Vol. 5 Issue 11-14, p686 

    Risk management is an essential fraction of project management, which plays a significant role in project success. Many failures associated with Web projects are the consequences of poor awareness of the risks involved and lack of process models that can serve as a guideline for the development...

  • An Evaluation of Performance Through the Structure Framework of Interface. Jhih - Syong Pan; Chiuhsiang Joe Lin; Wei-Jung Shiang // World Academy of Science, Engineering & Technology;2011, Issue 58, p208 

    Since the establishment of web interface applications in 1992, it has been promoted worldwide at an astonishing speed; until now, web interface is the most common World Wide Web network. People rely on computer applications to learn, work and interact, make the usability of web interface more...

  • What Does the WWW Offer Mathematics Students and Teachers. KNOTT, RON // Teaching Mathematics & its Applications;1999, Vol. 18 Issue 1, p2 

    A brief historical sketch of the World Wide Web (WWW) explains why browsers are needed and what they can do for mathematics teachers and students. We compare information available on the Web for mathematics with an encyclopaedia and with books, say why the Web offers something vital and...

  • Les mashups : une illustration de l'agilité en marketing. Mercanti-Guérin, Maria // Decisions Marketing;jui-sep2013, Issue 71, p125 

    Mashup is a Web application primarily known for its use in cartographic data. However, mashups are becoming widespread in e-commerce Websites and provide services that go well beyond mapping. The objective of this research is to understand and measure the implications of these new applications...

  • A Practical T-P³ R² Model to Test Dynamic Websites. Chopra, Rajiv; Madan, Sushila // Journal of Information Engineering & Applications;2012, Vol. 2 Issue 6, p44 

    Present day web applications are very complex as they employ more objects (controls) on a web page than traditional web applications. This results in more memory leaks, more CPU utilizations and longer test executions. Furthermore, today websites are dynamic meaning that the web pages are loaded...

  • A NEW MODEL FOR E-BUSINESS PERFORMANCE TESTING. BABU, P. CHITTI; BHARATHI, K. C. K.; MOHAMED, J. SHAIK // Journal on Software Engineering;Jul-Sep2013, Vol. 8 Issue 1, p35 

    The extraordinary growth in the World Wide Web has been sweeping through business and industry. By using web technologies many companies have developed or integrated their critical applications. Testing web applications become crucial, particularly as performance of web applications become...

  • CCAMS: A Tool for Co-Curricular Activities Management. Farhan, Muhammad; Malik, Kaleem Razzaq; Farooq, Amjad // Journal of Computing;Aug2011, Vol. 3 Issue 8, p39 

    Learning management systems are widely used and also available in open source repositories for the curricular activities management. In this paper we will explore some learning management systems to check the support of co-curricular activities management. Students learn with the co-curricular...

  • XWADF: Architectural Pattern for Improving Performance of Web Applications. M. d. Umar Khan; Rao, T. V. // International Journal of Computer Science Issues (IJCSI);Mar2014, Vol. 11 Issue 2, p105 

    Ever since the advent of World Wide Web (WWW) web sites and their usage has become part of day-to-day life.Enterprises reach global audience through web applications. People of all walks of life need to use web applications in one way or other. Performance of web applications plays a key role in...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics