Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to  upgrade your browser .

Enter the email address you signed up with and we'll email you a reset link.

  • We're Hiring!
  • Help Center

paper cover thumbnail

Web Mining: A Survey of Current Research, Techniques, and Software

Profile image of Richard Segall

2008, International Journal of Information Technology & Decision Making

The purpose of this paper is to provide a more current evaluation and update of web mining research and techniques available. Current advances in each of the three different types of web mining are reviewed in the categories of web content mining, web usage mining, and web structure mining. For each tabulated research work, we examine such key issues as web mining process, methods/techniques, applications, data sources, and software used. Unlike previous investigators, we divide web mining processes into the following five subtasks: (1) resource finding and retrieving, (2) information selection and preprocessing, (3) patterns analysis and recognition, (4) validation and interpretation, and (5) visualization. This paper also reports the comparisons and summaries of selected software for web mining. The web mining software selected for discussion and comparison in this paper are SPSS Clementine, Megaputer PolyAnalyst, ClickTracks by web analytics, and QL2 by QL2 Software Inc. Applicat...

Related Papers

anurag kumar

Web Mining is moving the World Wide Web towards a more useful environment in which users can quickly and easily find the information they need. Large amount of text documents, multimedia files and images are available in the web and it is still increasing. Data mining is the form of extracting data’s available in the internet. Web mining is a part of data mining. Web mining is used to discover and extract information from Web-related data sources such as Web documents, Web content, hyperlinks and server logs. The term Web mining has been used in three distinct ways. The first, called Web content mining is the process of information discovery from sources across the World Wide Web. The second, called Web structure mining is the process of analyzing the relationship between Web pages linked by information or direct link connection through the use of graph theory. The third, called Web usage mining is the process of extracting patterns and information from server logs to gain insight on user activity. In this paper, we are trying to give a brief idea regarding web mining concerned with its techniques, tools and applications.

web mining research papers 2021 pdf

Richard Segall

Venkata Ramana

brijesh singh

Bonfring International Journal

Web is a platforms for information exchange, as it is simple and easy to publish documents. Searching for information becomes a difficult and time-consuming process as the web grows. Web mining uses various data mining techniques to discover useful knowledge from usage log file from the web. The mining tools are used to scan the HTML documents, images, and text, the results is provided for the search engines.It can assist search engines in providing productive results of each search in order of their relevance. In this paper, we brief introduction to the concepts related to web mining and then an overview of different Web usage mining.

Dr. M.A.Dorairangaswamy

This study presents the role of Web mining an explosive growth of the World Wide Web; websites are providing an information and knowledge to the end users. This is the review paper which show deep and intense study of various technologies available for web mining and it is the application of data mining techniques to extract knowledge from web. Current advances in each of the three different types of web mining are reviewed in the categories of web content mining, web usage mining, and web structure mining. Index Terms—web mining, web content mining, web usage mining, web structure mining.

aarti Pandey

Jawad Mughal

Research Publish Journals

Abstract: Web mining is a very hot research topic which combines two of the activated research areas: Data Mining and World Wide Web. The Web mining research relates to several research communities such as Database, Information Retrieval and Artificial Intelligence. Although there exists quite some confusion about the Web mining, the most recognized approach is to categorize Web mining into three areas: Web content mining, Web structure mining, and Web usage mining. Web content mining focuses on the discovery/retrieval of the useful information from the Web contents/data/documents, while the Web structure mining emphasizes to the discovery of how to model the underlying link structures of the Web. The distinction between these two categories isn't a very clear sometimes. Web usage mining is relative independent, but not isolated, category, which mainly describes the techniques that discover the user's usage pattern and try to predict the user's behaviors. This paper is a survey based on the recently published research papers. Besides providing an overall view of Web mining, this paper will focus on Web usage mining. Generally speaking, Web usage mining consists of three phases: Pre-processing, Pattern discovery and Pattern analysis. A detailed description will be given for each part of them, however, special attention will be paid to the user navigation patterns discovery and analysis. The user privacy is another important issue in this paper. An example of a prototypical Web usage mining system, WebSIFT, will be introduced to make it easier to understand the methodology of how to apply data mining techniques to large Web data repositories in order to extract usage patterns. Finally, along with some other interested research issues, a brief overview of the current research work in the area of Web usage mining is included. Title: WEB MINING AN APPLICATION OF DATA MINING Author: Sumit Dalal, Sumit Kumar, Vivek Dixit International Journal of Computer Science and Information Technology Research ISSN 2348-120X (online), ISSN 2348-1196 (print) Research Publish Journals

maithreyan surya

The World Wide Web is a popular and interactive medium to disseminate information today. It is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images, videos, and other multimedia, and navigate between them via hyperlinks. With the recent explosive growth of the amount of content on the Internet, it has become increasingly difficult for users to find and utilize information and for content providers to classify and catalog documents on the World Wide Web. Traditional web search engines often return hundreds or thousands of results for a search, which is time consuming for users to browse. On-line libraries, search engines, and other large document repositories (e.g. customer support databases, product specification databases, press release archives, news story archives, etc.) are growing so rapidly that it is difficult and costly to categorize every document manually. To deal with these problems web mining is used. Web mining is the use of data mining techniques to automatically discover and extract information from the web documents and services. This paper presents an overview of web mining, its methodologies, algorithms and applications.

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

RELATED PAPERS

International Journal on Advanced Science, Engineering and Information Technology

azizul azhar ramli (PhD - Soft Computing)

International Journal of Engineering Sciences & Research Technology

Ijesrt Journal

IJESRT Journal

International Journal of Engineering Research and Technology (IJERT)

IJERT Journal

Journal of Computer Science IJCSIS

Srinaganya Gopalrathnam

Editor IJRET

nasrin jokar

Ijetrm Journal

SenthilKumar N

Science and Education Publishing

Shyam Nandan Kumar

Archit Joshi

ACM SIGKDD Explorations Newsletter

International Journal of Computer Science and Informatics

Nagaratna Hegde

balaji narayanaswami

international journal of engineering trends and technology

nitin chopde

WSEAS Transactions on Information Science and …

Jose Aguilar

christy eunaicy

Myra Spiliopoulou

International Journal IJRITCC

International Journal of Computer Applications

Dr Pawan Singh

RELATED TOPICS

  •   We're Hiring!
  •   Help Center
  • Find new research papers in:
  • Health Sciences
  • Earth Sciences
  • Cognitive Science
  • Mathematics
  • Computer Science
  • Academia ©2024

Text and Web Content Mining: A Systematic Review

  • Conference paper
  • First Online: 08 August 2021
  • Cite this conference paper

web mining research papers 2021 pdf

  • Fatima Almatrooshi   ORCID: orcid.org/0000-0001-9446-273X 13 ,
  • Sumayya Alhammadi   ORCID: orcid.org/0000-0002-0586-1403 13 ,
  • Said A. Salloum   ORCID: orcid.org/0000-0002-6073-3981 14 , 15 &
  • Khaled Shaalan   ORCID: orcid.org/0000-0003-0823-8390 13  

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 299))

Included in the following conference series:

  • International Conference on Emerging Technologies and Intelligent Systems

1139 Accesses

Text and content mining are the subcategory of data mining. This category of data mining is used to extract the information from web or web pages of a website. This mining identifies useful information from web content like web pages, search logs, and other website-related content. The extracted information can be used in many applications, for example, we can extract opinions from online sources and web hierarchy which provides better insights and knowledge. In this paper, we conducted a systematic review that included 18 research papers that are relevant to the topic and matches the inclusion criteria of this study. From these research papers we were able to answer the research questions that we identified. The questions are related to the applications, techniques and issues of the text and web mining. The findings suggest that many research papers made a good foundation for this topic, and gave an informative explanation of each type of techniques used in text and web mining, as well as some issues that can be a future work for researchers who are interested in the topic.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

web mining research papers 2021 pdf

A Review: Web Content Mining Techniques

web mining research papers 2021 pdf

In Search of Insight from Unstructured Text Data: Towards an Identification of Text Mining Techniques

web mining research papers 2021 pdf

Web Structure Mining Algorithms: A Survey

Özyirmidokuz, E.K., Özyirmidokuz, M.H.: Analyzing customer complaints: a web text mining application. In: Proceedings of INTCESS14-International Conference on Education and Social Sciences, Istanbul, pp. 734–743 (2014)

Google Scholar  

What is Text Mining: Techniques and Applications (2020)

Sharma, A.K., Gupta, P.: Study and analysis of web content mining tools to improve techniques of web data mining. Int. J. Adv. Res. 1 (8), 287–293 (2012)

Hamid Mughal, M.J.: Data mining: web data mining techniques, tools and algorithms: an overview. Int. J. Adv. Comput. Sci. Appl. 9 (6), 208–215 (2018)

Milic, P.: Framework for Web Content Mining (2017)

Download references

Acknowledgment

This work is a part of a project submitted in fulfilment of PhD Computer Science - Faculty of Engineering & Information Technology at The British University in Dubai.

Author information

Authors and affiliations.

Faculty of Engineering and IT, The British University in Dubai, Dubai, UAE

Fatima Almatrooshi, Sumayya Alhammadi & Khaled Shaalan

School of Science, Engineering, and Environment, University of Salford, Salford, UK

Said A. Salloum

Machine Learning and NLP Research Group, Department of Computer Science, University of Sharjah, Sharjah, UAE

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Said A. Salloum .

Editor information

Editors and affiliations.

The British University in Dubai, Dubai, United Arab Emirates

Mostafa Al-Emran

Universiti Teknologi Malaysia, Skudai, Johor, Malaysia

Mohammed A. Al-Sharafi

Al Buraimi University College, Al Buraimi, Oman

Mohammed N. Al-Kabi

Khaled Shaalan

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Cite this paper.

Almatrooshi, F., Alhammadi, S., Salloum, S.A., Shaalan, K. (2022). Text and Web Content Mining: A Systematic Review. In: Al-Emran, M., Al-Sharafi, M.A., Al-Kabi, M.N., Shaalan, K. (eds) Proceedings of International Conference on Emerging Technologies and Intelligent Systems. ICETIS 2021. Lecture Notes in Networks and Systems, vol 299. Springer, Cham. https://doi.org/10.1007/978-3-030-82616-1_8

Download citation

DOI : https://doi.org/10.1007/978-3-030-82616-1_8

Published : 08 August 2021

Publisher Name : Springer, Cham

Print ISBN : 978-3-030-82615-4

Online ISBN : 978-3-030-82616-1

eBook Packages : Intelligent Technologies and Robotics Intelligent Technologies and Robotics (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

The Science and Information (SAI) Organization

Publication Links

  • Author Guidelines
  • Publication Policies
  • Metadata Harvesting (OAI2)
  • Digital Archiving Policy
  • Promote your Publication
  • About the Journal
  • Call for Papers
  • Submit your Paper
  • Current Issue
  • Apply as a Reviewer
  • Indexing & Archiving

Special Issues

  • Guest Editors

Future of Information and Communication Conference (FICC)

  • Submit your Paper/Poster

Computing Conference

Intelligent Systems Conference (IntelliSys)

Future Technologies Conference (FTC)

DOI: 10.14569/IJACSA.2021.0120886 PDF

A Systematic Review Web Content Mining Tools and its Applications

Author 1: Manjunath Pujar Author 2: Monica R Mundada

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 12 Issue 8, 2021.

  • Abstract and Keywords
  • How to Cite this Article
  • {} BibTeX Source

Abstract: In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of unstructured and structured information that impacts the day to day life of the society. Because of such availability of huge information, utilization of the required information becomes more challenging. This paper provided a comprehensive survey on the current situation and recent trends on web content mining (WCM) and its applications thereby contributing to the enhancement of the upcoming research in WCM. The paper focused mainly on the mining and retrieval techniques, various WCM approaches, challenges and process of information retrieval and information extraction. The paper describes the four major tasks of web content mining that is information retrieval, information extraction, generalization and validation in detail. WCM concentrates on orchestrating, sorting, classifying, collecting, congregating of web data and provide the improved data which can be easily accessed by the users. Web content mining tools were needed to scan text, images and HTML documents and provide results to the search engine. It guides the search engine to provide better productive results for every search based on their importance. The paper also analysed different web content mining tools for the extraction of relevant information from the corresponding web page.

Manjunath Pujar and Monica R Mundada, “A Systematic Review Web Content Mining Tools and its Applications” International Journal of Advanced Computer Science and Applications(IJACSA), 12(8), 2021. http://dx.doi.org/10.14569/IJACSA.2021.0120886

@article{Pujar2021, title = {A Systematic Review Web Content Mining Tools and its Applications}, journal = {International Journal of Advanced Computer Science and Applications}, doi = {10.14569/IJACSA.2021.0120886}, url = {http://dx.doi.org/10.14569/IJACSA.2021.0120886}, year = {2021}, publisher = {The Science and Information Organization}, volume = {12}, number = {8}, author = {Manjunath Pujar and Monica R Mundada} }

Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

IJACSA

Upcoming Conferences

web mining research papers 2021 pdf

Future of Information and Communication Conference (FICC) 2024

28-29 April 2025

  • Berlin, Germany

web mining research papers 2021 pdf

Computing Conference 2024

11-12 July 2024

  • London, United Kingdom

web mining research papers 2021 pdf

IntelliSys 2024

5-6 September 2024

  • Amsterdam, The Netherlands

web mining research papers 2021 pdf

Future Technologies Conference (FTC) 2024

14-15 November 2024

web mining research papers 2021 pdf

Login to your account

Change password, your password must have 8 characters or more and contain 3 of the following:.

  • a lower case character, 
  • an upper case character, 
  • a special character 

Password Changed Successfully

Your password has been changed

Create a new account

Can't sign in? Forgot your password?

Enter your email address below and we will send you the reset instructions

If the address matches an existing account you will receive an email with instructions to reset your password

Request Username

Can't sign in? Forgot your username?

Enter your email address below and we will send you your username

If the address matches an existing account you will receive an email with instructions to retrieve your username

World Scientific

  • This Journal
  •   
  • Institutional Access

Cookies Notification

Our site uses javascript to enchance its usability. you can disable your ad blocker or whitelist our website www.worldscientific.com to view the full content., select your blocker:, adblock plus instructions.

  • Click the AdBlock Plus icon in the extension bar
  • Click the blue power button
  • Click refresh

Adblock Instructions

  • Click the AdBlock icon
  • Click "Don't run on pages on this site"

uBlock Origin Instructions

  • Click on the uBlock Origin icon in the extension bar
  • Click on the big, blue power button
  • Refresh the web page

uBlock Instructions

  • Click on the uBlock icon in the extension bar

Adguard Instructions

  • Click on the Adguard icon in the extension bar
  • Click on the toggle next to the "Protection on this website" text

Brave Instructions

  • Click on the orange lion icon to the right of the address bar
  • Click the toggle on the top right, shifting from "Up" to "Down

Adremover Instructions

  • Click on the AdRemover icon in the extension bar
  • Click the "Don’t run on pages on this domain" button
  • Click "Exclude"

Adblock Genesis Instructions

  • Click on the Adblock Genesis icon in the extension bar
  • Click on the button that says "Whitelist Website"

Super Adblocker Instructions

  • Click on the Super Adblocker icon in the extension bar
  • Click on the "Don’t run on pages on this domain" button
  • Click the "Exclude" button on the pop-up

Ultrablock Instructions

  • Click on the UltraBlock icon in the extension bar
  • Click on the "Disable UltraBlock for ‘domain name here’" button

Ad Aware Instructions

  • Click on the AdAware icon in the extension bar
  • Click on the large orange power button

Ghostery Instructions

  • Click on the Ghostery icon in the extension bar
  • Click on the "Trust Site" button

Firefox Tracking Protection Instructions

  • Click on the shield icon on the left side of the address bar
  • Click on the toggle that says "Enhanced Tracking protection is ON for this site"

Duck Duck Go Instructions

  • Click on the DuckDuckGo icon in the extension bar
  • Click on the toggle next to the words "Site Privacy Protection"

Privacy Badger Instructions

  • Click on the Privacy Badger icon in the extension bar
  • Click on the button that says "Disable Privacy Badger for this site"

Disconnect Instructions

  • Click on the Disconnect icon in the extension bar
  • Click the button that says "Whitelist Site"

Opera Instructions

  • Click on the blue shield icon on the right side of the address bar
  • Click the toggle next to "Ads are blocked on this site"

CONNECT Login Notice

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Web mining: a survey of current research, techniques, and software.

  • QINGYU ZHANG  and 
  • RICHARD S. SEGALL

Department of Computer & Information Technology, Arkansas State University, State University, Arkansas 72467-0130, USA

Corresponding author.

Search for more papers by this author

The purpose of this paper is to provide a more current evaluation and update of web mining research and techniques available. Current advances in each of the three different types of web mining are reviewed in the categories of web content mining, web usage mining, and web structure mining. For each tabulated research work, we examine such key issues as web mining process, methods/techniques, applications, data sources, and software used. Unlike previous investigators, we divide web mining processes into the following five subtasks: (1) resource finding and retrieving, (2) information selection and preprocessing, (3) patterns analysis and recognition, (4) validation and interpretation, and (5) visualization. This paper also reports the comparisons and summaries of selected software for web mining. The web mining software selected for discussion and comparison in this paper are SPSS Clementine, Megaputer PolyAnalyst, ClickTracks by web analytics, and QL2 by QL2 Software Inc. Applications of these selected web mining software to available data sets are discussed together with abundant presentations of screen shots, as well as conclusions and future directions of the research.

  • web content mining
  • web usage mining
  • web structure mining
  • web mining software
  • L. Abraham and V. Ramos , Web usage mining using artificial ant colony clustering and genetic programming , CEC'03 — Congress on Evolutionary Computation (IEEE Press, Canberra, Australia, 2003 ) pp. 1384–1391. Google Scholar
  • N. Barsagade, Web usage mining and pattern discovery: A survey paper, Computer Science and Engineering Dept., CSE Tech Report 8331 (Southern Methodist University, Dallas, Texas, USA, 2003) . Google Scholar
  • S.   Chakrabarti , Mining the Web: Discovering Knowledge from Hypertext Data ( Elsevier Science & Technology Books , 2003 ) . Google Scholar
  • R. Chau , C. Yeh and K. Smith , Personalized multilingual web content mining , KES ( 2004 ) pp. 155–163. Google Scholar
  • L. Chen, W. Lian and W. Chue, Inform. Process. Management: Int. J.   41 (5), 1225 (2005). Web of Science ,  Google Scholar
  • T. S.   Clendaniel , The Data Administration Newsletter ( R.S. Seiner, Publisher , 2002 ) ,   http://www.tdan.com/i019fe02.htm . Google Scholar
  • R. Cooley, ACM Trans. Internet Tech.   3 (2), 93 (2003), DOI: 10.1145/767193.767194 . Google Scholar
  • R.   Cooley , B.   Mobasher and J.   Srivastava , Web mining: Information and pattern discovery on the world wide web , ICTAI ( 1997 ) , DOI: 10.1109/TAI.1997.632303 . Google Scholar
  • R. Cooley, B. Mobasher and J. Srivastava, Data preparation for mining worldwide browsing patterns (2003), http://maya.cs.depaul.edu/~classes/ect584/papers/cms-kais.pdf . Google Scholar
  • S. Cyrus and F. Banaei-Kashani, INFORMS J. Comput. Special Issue on Data Mining   15 (2), 123 (2003). Web of Science ,  Google Scholar
  • J. Darmont, O. Boussaid and F. Bentayeb, Warehousing Web Data (2007), http://www.arxiv.org/ftp/arxiv/papers/0705/0705.1456.pdf . Google Scholar
  • O. Etzioni, Commun. ACM   39 (11), 65 (1996), DOI: 10.1145/240455.240473 . Web of Science ,  Google Scholar
  • X. Fang and O. Sheng, ACM Trans. Internet Tech.   4 (2), 209 (2004), DOI: 10.1145/990301.990306 . Google Scholar
  • K. Fenstermacher and M. Ginsburg, J. Am. Soc. Inform. Sci. Tech.   54 (7), 625 (2003), DOI: 10.1002/asi.10255 . Web of Science ,  Google Scholar
  • J. Furnkranz, ÖGAI-J.   21 (2), 17 (2002). Web of Science ,  Google Scholar
  • S.   Graves et al. , Deployable suite of data mining web services for online science data repositories , 23rd Conf. IIPS, 87th American Meteorological Society Annual Meeting . Google Scholar
  • D. Greening, Data mining on the web: There's gold in that mountain of data (2006) http://www.webtechniques.com/archives/2000/01/greening/ . Google Scholar
  • S. Guan and P. McMullen, Int. J. Inform. Technol. Decision Making   4 (1), 97 (2005), DOI: 10.1142/S0219622005001404 . Link , Web of Science ,  Google Scholar
  • J. Han and C. Chang, Computer  54 (2002),  http://www-faculty.cs.uiuc.edu/~hanj/pdf/computer02.pdf . Google Scholar
  • B. Hay, G. Wets and K. Vanhoof, Knowledge Inform. Syst.   6 (2), 150 (2004). Web of Science ,  Google Scholar
  • A. Joshi, Web mining (2001), http://www.cs.umbc.edu/~ajoshi/web_mine . Google Scholar
  • A. Joshi, Web/data mining and personalization, University of Maryland Baltimore County (UMBC) eBiquity Research Area (2001), http://ebiquity.umbc.edu/project/html/id/17/Web-Data-Mining-and-Personalization . Google Scholar
  • D. Kanellopoulos and S. Kotsiantis, Int. Rev. Comput. Software   3 (1), 428 (2001). Google Scholar
  • P. Kolari and A. Joshi, Comput. Sci. Eng.  42 (2004), DOI: 10.1109/MCSE.2004.23 . Google Scholar
  • R. Kosala and H. Blockeel, ACM SIGKDD Explor.   2 , 1 (2000), DOI: 10.1145/360402.360406 . Google Scholar
  • K. Lau et al. , J. Database Marketing Customer Strategy Management   12 (1), 32 (2004), DOI: 10.1057/palgrave.dbm.3240241 . Google Scholar
  • J. W. Liang, Introduction to text and web mining, Seminar at North Carolina Technical University (2003), http://www.database.cis.nctu.edu.tw/seminars/2003F/TWM/slides/p.ppt . Google Scholar
  • B. Liu, Web content mining (2005), http://www.cs.uic.edu/~liub/WebContent-Mining.html . Google Scholar
  • C. Liu, Web content mining (29 November 2006), CM SIGKDD Webcast . Google Scholar
  • B.   Liu , Web Data Mining: Exploring Hyperlinks, Contents and Usage Data ( Springer Verlag Press , 2007 ) . Google Scholar
  • B. Liu and K. Chang, SIGKDD Explorations   6 (2), 1 (2004), DOI: 10.1145/1046456.1046457 . Google Scholar
  • Z.   Markov and D. T.   Larose , New Report "Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage" Features Web Structure Mining, Web Content Mining and Web Usage Mining ( John Wiley and Sons , 2007 ) ,   http://biz.yahoo.com/bw/070525/20070525005213.html?.v=1 . Google Scholar
  • Megaputer Intellegence Inc., WebAnalyst architecture (2007), http://www.megputer.com/products/wa/architechture.php3 . Google Scholar
  • B. Mobasher, Web data mining for business intelligence, ECT 584, DePaul University, Chicago, IL (2007), http://maya.cs.depaul.edu/~classes/ect584/papers/mobasher.pdf . Google Scholar
  • B. Mobasher, R. Colley and J. Srivastava, Commun. ACM   43 (8), 142 (2000), DOI: 10.1145/345124.345169 . Web of Science ,  Google Scholar
  • B.   Mobasher et al. , Integrating web usage and content mining for more effective personalization , EC-Web ( 2000 ) . Google Scholar
  • Z. Pabarskaite and A. Raudys, J. Intell. Inform. Syst.   28 (1), 79 (2007), DOI: 10.1007/s10844-006-0004-1 . Web of Science ,  Google Scholar
  • S. Palmer, The semantic web: An introduction (2001), http://infomesh.net/2001/swintro/ . Google Scholar
  • D. Pierrakos et al. , User Model. User-Adapt. Interact.   13 (4), 311 (2003), DOI: 10.1023/A:1026238916441 . Web of Science ,  Google Scholar
  • QL2 software (2007), http://www.ql2.com, viewed 5 June 2007 . Google Scholar
  • A. Scime, J. Intell. Inform. Syst.   22 (3), 211 (2004), DOI: 10.1023/B:JIIS.0000019288.63141.e4 . Web of Science ,  Google Scholar
  • A.   Scime , Web Mining: Applications and Techniques ( Idea Group Publishing , Hershey, PA , 2005 ) . Google Scholar
  • Semantic Web Agreement Group, What is the semantic web? (2001) http://swag.webns.net/WhatIsSW . Google Scholar
  • K. A. Smith and A. Ng, Decision Support Syst.   35 (2), 245 (2003), DOI: 10.1016/S0167-9236(02)00109-4 . Web of Science ,  Google Scholar
  • Q. Song and M. Shepperd, Comput. Indus.   57 (7), 622 (2006), DOI: 10.1016/j.compind.2005.11.006 . Web of Science ,  Google Scholar
  • M. Spiliopoulou, Commun. ACM   43 (8), 127 (2000), DOI: 10.1145/345124.345167 . Web of Science ,  Google Scholar
  • SPSS, Web mining for Clementine (2007) http://www.spss.com/web_mining_for_clementine, viewed 16 May 2007 . Google Scholar
  • J. Srivastava et al. , SIGKDD Explor.   1 (2), 12 (2000), DOI: 10.1145/846183.846188 . Google Scholar
  • W3Schools, Semantic web tutorial (2008) http://www.w3schools.com/semweb/default.asp . Google Scholar
  • Web Analytics (2007) http://www.clicktracks.com/, viewed as 25 October 2007 . Google Scholar
  • Wikipedia, Web mining (2007) http://en.wikipedia.org/wiki/Web_mining . Google Scholar
  • Wikipedia, Semantic web (2008) http://en.wikipedia.org/wiki/Semantic_Web . Google Scholar
  • Q. Yang and X. Wu, Int. J Inform. Technol. Decision Making   5 (4), 597 (2006), DOI: 10.1142/S0219622006002258 . Link , Web of Science ,  Google Scholar
  • A.   Zanasi et al. , Data mining VII: Data, text and web mining and their business applications (Data Mining and Information Engineering 2006) , Transactions on Information and Communication Technologies   37 ( 2006 ) . Google Scholar
  • H. Zhang et al. , World Wide Web   6 (2), 131 (2003), DOI: 10.1023/A:1023618504691 . Web of Science ,  Google Scholar
  • N. Zhong, Int. J. Inform. Technol. Decision Making   5 (4), 713 (2006), DOI: 10.1142/S0219622006002283 . Link , Web of Science ,  Google Scholar
  • G. Adomavicius and A. Tuzhilin, Data Mining Knowledge Discov.   5 , 33 (2001), DOI: 10.1023/A:1009839827683 . Web of Science ,  Google Scholar
  • R. Amarasiri and D. Alahakoon, Neural Comput. Appl.   13 , 149 (2004). Web of Science ,  Google Scholar
  • R. Boncella, Commun. Assoc. Inform. Syst.   12 , 327 (2005). Web of Science ,  Google Scholar
  • M. Ceci and D. Malerba, J. Intell. Inform. Syst.   28 , 37 (2007), DOI: 10.1007/s10844-006-0003-2 . Web of Science ,  Google Scholar
  • M. Chau et al. , Decision Support Syst.   35 , 167 (2003), DOI: 10.1016/S0167-9236(02)00103-3 . Web of Science ,  Google Scholar
  • R. Chau and C. Yeh, Neural Comput. Appl.   13 , 140 (2004). Web of Science ,  Google Scholar
  • Z. Chen, A. Fu and F. Tong, World Wide Web   6 , 259 (2003), DOI: 10.1023/A:1024606901978 . Web of Science ,  Google Scholar
  • R. Chen, K. Sivakumar and H. Kargupta, Knowledge Inform. Syst.   6 , 164 (2004). Web of Science ,  Google Scholar
  • S. Dustdar and R. Gombotz, Int. J. Business Process Integrat. Management   1 , 256 (2007), DOI: 10.1504/IJBPIM.2006.012624 . Google Scholar
  • M. Eirinaki and M. Vazirgiannis, ACM Trans. Internet Tech.   3 , 1 (2003), DOI: 10.1145/643477.643478 . Web of Science ,  Google Scholar
  • B. Ezeife and Y. Lu, Data Mining Knowledge Discov.   10 , 5 (2005), DOI: 10.1007/s10618-005-0248-3 . Web of Science ,  Google Scholar
  • F. Faccaand and P. Lanzi, Data Knowledge Eng.   53 , 225 (2005). Web of Science ,  Google Scholar
  • S. Flesca et al. , World Wide Web   8 , 317 (2005), DOI: 10.1007/s11280-005-1315-9 . Web of Science ,  Google Scholar
  • P. Giudici and R. Castelo, Data Mining Knowledge Discov.   5 , 183 (2001), DOI: 10.1023/A:1011469000311 . Web of Science ,  Google Scholar
  • B. Gregg and S. Walczak, Commun. ACM   49 , 78 (2006), DOI: 10.1145/1125944.1125945 . Web of Science ,  Google Scholar
  • W. Grossmann, M. Hudec and R. Kurzawa, Int. J. Electron. Business   2 , 480 (2004), DOI: 10.1504/IJEB.2004.005881 . Web of Science ,  Google Scholar
  • H. Han and R. Elmasri, J. Intell. Inform. Syst.   22 , 237 (2004), DOI: 10.1023/B:JIIS.0000019278.84222.b7 . Web of Science ,  Google Scholar
  • B. Haruechaiyasak and M. Shyu, Int. J. Comput. Appl. Tech.   27 , 298 (2007), DOI: 10.1504/IJCAT.2006.012000 . Web of Science ,  Google Scholar
  • B. Huang and T. Chou, Electron. Commerce Res. Appl.   3 , 266 (2004), DOI: 10.1016/j.elerap.2004.04.002 . Google Scholar
  • X. Huang et al. , J. Am. Soc. Inform. Sci. Tech.   56 , 1290 (2004). Google Scholar
  • X. Jiang, Inform. Software Tech.   46 , 55 (2004). Web of Science ,  Google Scholar
  • K. Joshi, A. Joshi and Y. Yesha, Distributed Parallel Databases   13 , 161 (2003), DOI: 10.1023/A:1021515408295 . Web of Science ,  Google Scholar
  • Kdnuggets, Software: Web Mining and Web Usage Mining, http://www.kdnuggets.com/software/web-mining.html . Google Scholar
  • D. Kim, H. Jung and L. Geunbae, Inform. Process. Manag.   39 , 623 (2003), DOI: 10.1016/S0306-4573(03)00004-9 . Web of Science ,  Google Scholar
  • Y. Kotb, K. Gondow and T. Katayama, J. Int. Inform. Syst.   22 , 257 (2004), DOI: 10.1023/B:JIIS.0000019279.81927.c5 . Web of Science ,  Google Scholar
  • Y. Kuo and L. Chen, Expert Syst. Appl.   21 , 203 (2001), DOI: 10.1016/S0957-4174(01)00040-9 . Web of Science ,  Google Scholar
  • C. Lee, Y. Kim and P. Rhee, Expert Syst. Appl.   21 , 131 (2001), DOI: 10.1016/S0957-4174(01)00034-3 . Web of Science ,  Google Scholar
  • A. Maguitman et al. , World Wide Web   9 , 431 (2006), DOI: 10.1007/s11280-006-8562-2 . Web of Science ,  Google Scholar
  • A. Nanopoulos and Y. Manolopoulos, Data Knowledge Eng.   37 , 243 (2001), DOI: 10.1016/S0169-023X(01)00008-8 . Web of Science ,  Google Scholar
  • Z. Pabatsjaute, Intll. Data Anal.   7 , 141 (2003). Web of Science ,  Google Scholar
  • D. Roussinov and F. Zhao, J. Enterprise Inform. Management   17 , 424 (2004), DOI: 10.1108/17410390410566715 . Google Scholar
  • B. Sakkopoulos, D. Kanellopoulos and A. Tsakalidis, Int. J. Metadata Semant. Ontol.   1 , 66 (2006), DOI: 10.1504/IJMSO.2006.008771 . Google Scholar
  • M. Shyu, C. Haruechaiyasak and S. Chen, Knowledge Inform. Syst.   10 , 515 (2006), DOI: 10.1007/s10115-006-0004-z . Web of Science ,  Google Scholar
  • K. Smith and A. Ng, Decision Support Syst.   35 , 245 (2003), DOI: 10.1016/S0167-9236(02)00109-4 . Web of Science ,  Google Scholar
  • P. Smyth, D. Pregibon and C. Faloutsos, Commun. ACM   45 , 33 (2002). Google Scholar
  • A.   Thuraisingham , Web Data Mining and Applications in Business Intelligence and Counter-Terrorism ( CRC Press , 2004 ) . Google Scholar
  • L. van Wel and R. Lambèr, Ethics Inform. Tech.   6 , 129 (2004), DOI: 10.1023/B:ETIN.0000047476.05912.3d . Web of Science ,  Google Scholar
  • Y. Wu and A. Chen, World Wide Web   5 , 67 (2002), DOI: 10.1023/A:1015750423727 . Google Scholar
  • Q. Yang, T. Li and K. Wang, Data Mining Knowledge Discov.   8 , 253 (2004), DOI: 10.1023/B:DAMI.0000023675.04946.f1 . Web of Science ,  Google Scholar
  • Q. Yang, T. Li and K. Wang, Appl. Artific. Intell.   17 , 431 (2003), DOI: 10.1080/713827182 . Web of Science ,  Google Scholar
  • Q. Yang and H. Zhang, World Wide Web   4 , 299 (2001), DOI: 10.1023/A:1015185802583 . Google Scholar
  • A. Zhang and Y. Dong, Comput. Networks   39 , 303 (2002), DOI: 10.1016/S1389-1286(02)00211-6 . Web of Science ,  Google Scholar
  • Trending Topics of Malaysia Through Social Media Analytics Shaunvinder Singh and Rajasvaran Logeswaran 4 Dec 2023
  • Federated Few-shot Learning Song Wang, Xingbo Fu, Kaize Ding, Chen Chen and Huiyuan Chen et al. 4 August 2023
  • Web Mining Petar Ristoski 26 February 2023
  • Role of Natural Language Processing for Text Mining of Education Policy in Rajasthan Pooja Jain and Shobha Lal 22 September 2023
  • Efficient algorithms for deriving complete frequent itemsets from frequent closed itemsets Cheng-Wei Wu, JianTao Huang, Yun-Wei Lin, Chien-Yu Chuang and Yu-Chee Tseng 11 April 2021 | Applied Intelligence, Vol. 52, No. 6
  • Using Machine Learning to Compare the Information Needs and Interactions of Facebook: Taking Six Retail Brands as an Example Yulin Chen 17 December 2021 | Information, Vol. 12, No. 12
  • RETRACTED ARTICLE: A Sly Salvage of Semantic Web Content with Insistence of Low Precision and Low Recall R. Bhavani, V. Prakash and K. Chitra 6 February 2020 | Wireless Personal Communications, Vol. 117, No. 4
  • Swarm Intelligence and cyber-physical systems: Concepts, challenges and future trends Melanie Schranz, Gianni A. Di Caro, Thomas Schmickl, Wilfried Elmenreich and Farshad Arvin et al. 1 Feb 2021 | Swarm and Evolutionary Computation, Vol. 60
  • Andrea Ko  and 
  • Saira Gillani
  • Semantic Web mining for Content-Based Online Shopping Recommender Systems Ibukun Tolulope Afolabi, Opeyemi Samuel Makinde and Olufunke Oyejoke Oladipupo 1 Oct 2019 | International Journal of Intelligent Information Technologies, Vol. 15, No. 4
  • The Evolution of Trends and Techniques used for Data Mining Shahzad Nazir, Muhammad Asif and Shahbaz Ahmad 1 Feb 2019
  • An Efficient Algorithm for Deriving Frequent Itemsets from Lossless Condensed Representation JianTao Huang, Yi-Pei Lai, Chieh Lo and Cheng-Wei Wu 15 June 2019
  • Design and Implementation of Secured E-Business Structure with LTL Patterns for User Behavior Prediction Ayman Mohamed Mostafa 5 December 2019
  • Towards an Evolved Information Food Chain of World Wide Web and Taxonomy of Semantic Web Mining Priyanka Bhutani and Anju Saha 20 November 2018
  • Weblog Data Structuration Amine Ganibardi and Chérif Arab Ali 19 November 2018
  • A Genetics Clustering-Based Approach for Weblog Data Cleaning Amine Ganibardi and Cherif Arab Ali 1 Oct 2018
  • Web Usage Data Cleaning Amine Ganibardi and Chérif Arab Ali 8 August 2018
  • A survey paper on techniques and applications of web usage mining Subhi Jain, Ruchira Rawat and Bina Bhandari 1 Nov 2017
  • Automating the Extraction of Static Content and Dynamic Behaviour from e-Commerce Websites Joaõ Pedro Dias and Hugo Sereno Ferreira 1 Jan 2017 | Procedia Computer Science, Vol. 109
  • Analysis of Users’ Behavior in Structured e-Commerce Websites Sergio Hernandez, Pedro Alvarez, Javier Fabra and Joaquin Ezpeleta 1 Jan 2017 | IEEE Access, Vol. 5
  • Contribution to ontologies building using the semantic web and web mining Mohamed El Asikri, Jalal Laassiri, Salah-ddine Krit and Hassan Chaib 1 Sep 2016
  • A comprehensive study on the effects of using data mining techniques to predict tie strength Mohammad Karim Sohrabi and Soodeh Akbari 1 Jul 2016 | Computers in Human Behavior, Vol. 60
  • Classifier and feature set ensembles for web page classification Aytuğ Onan 29 June 2015 | Journal of Information Science, Vol. 42, No. 2
  • Application and Significance of Web Usage Mining in the 21st Century: A Literature Review M. Aldekhail 1 February 2016 | International Journal of Computer Theory and Engineering, Vol. 8, No. 1
  • An Automated Approach for Requirements Specification Maintenance J. Esparteiro Garcia and Ana C. R. Paiva 2 March 2016
  • Social Media Mining Vipul Gupta and Mayank Gupta 1 Jan 2016
  • Real-time web mining application to support decision-making process Jan Hovad, Martin Lnenicka and Jitka Komarkova 1 Jul 2015
  • Artificial Immune System Based Web Page Classification Aytuğ Onan 1 Jan 2015
  • A Proposal of a Big Web Data Application and Archive for the Distributed Data Processing with Apache Hadoop Martin Lnenicka, Jan Hovad and Jitka Komarkova 24 October 2015
  • Willingness to Be Financially Informed and the Benefits of Nudging Investors to Do So Remo Sttssel 1 Jan 2015 | SSRN Electronic Journal, Vol. 2
  • Webometrics benefitting from web mining? An investigation of methods and applications of two research fields David Gunnarsson Lorentzen 9 January 2014 | Scientometrics, Vol. 99, No. 2
  • Hsin-Chang Yang  and 
  • Chung-Hong Lee
  • Parallel data mining techniques on Graphics Processing Unit with Compute Unified Device Architecture (CUDA) Liheng Jian, Cheng Wang, Ying Liu, Shenshen Liang and Weidong Yi et al. 26 August 2011 | The Journal of Supercomputing, Vol. 64, No. 3
  • A Comprehensive Survey on Web Content Extraction Algorithms and Techniques Sumaia Mohammed Al-Ghuribi and Saleh Alshomrani 1 Jun 2013
  • RASIM M. ALGULIEV , 
  • RAMIZ M. ALIGULIYEV , and 
  • NIJAT R. ISAZADE
  • A devised framework for content recommendation system using collaborative log mining C. R. Varnagar, N. N. Madhak, T. M. Kodinariya and R. S. Agrawal 1 Mar 2013
  • Web usage mining: A review on process, methods and techniques C. R. Varnagar, N. N. Madhak, T. M. Kodinariya and J. N. Rathod 1 Feb 2013
  • Web data mining trends and techniques Ujwala Manoj Patil and J. B. Patil 3 August 2012
  • Multiple factor hierarchical clustering algorithm for large scale web page and search engine clickstream data Gang Kou and Chunwei Lou 14 February 2010 | Annals of Operations Research, Vol. 197, No. 1
  • Research on mobile learning system based on Web mining Zhengqiao Xu and Dewei Zhao 1 Jul 2012
  • External semantic annotation of web-databases Benjamin Donz and Dietmar Bruckner 1 May 2012
  • Web Mining and Security in E-commerce Shaikh Mohammed Atiq, Dayanand Ingle and B. B. Meshram 1 Jan 2012
  • XIANYONG FANG , 
  • CHRISTIAN JACQUEMIN , and 
  • FRÉDÉRIC VERNIER
  • Web Data Mining research: A survey Brijendra Singh and Hemant Kumar Singh 1 Dec 2010
  • JUNLIN ZHOU , 
  • ALEKSANDAR LAZAREVIC , 
  • KUO-WEI HSU , 
  • JAIDEEP SRIVASTAVA , 
  • YAN FU , and 
  • LAN ZHAO  and 
  • JOSÉ BORGES  and 
  • MARK LEVENE
  • UNIL YUN  and 
  • KEUN HO RYU
  • LI-AN HO , 
  • TSUNG-HSIEN KUO , 
  • CHINHO LIN , and 
  • BINSHAN LIN
  • Review of data, text and web mining software Qingyu Zhang and Richard S. Segall 4 May 2010 | Kybernetes, Vol. 39, No. 4
  • ANTONIO CABALLERO-LUQUE , 
  • PABLO ARAGONÉS-BELTRÁN , 
  • MÓNICA GARCÍA-MELÓN , and 
  • CARLOS DEMA-PÉREZ
  • Credit card customer analysis based on panel data clustering Guangli Nie, Yibing Chen, Lingling Zhang and Yuhong Guo 1 May 2010 | Procedia Computer Science, Vol. 1, No. 1
  • Commercial Data Mining Software Qingyu Zhang and Richard S. Segall 7 July 2010
  • Maximal Sequential Patterns René Arnulfo García-Hernández, J. Fco. Martínez-Trinidad and J. Ariel Carrasco-Ochoa

Recommended

Journal cover image

IEEE Account

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

  • DOI: 10.18535/IJECS/V6I1.29
  • Corpus ID: 63876488

A Study on Web Content Mining

  • Anurag Kumar
  • Published 12 January 2017
  • Computer Science
  • International Journal of Engineering and Computer Science

13 Citations

A survey on web mining techniques, data mining: web data mining techniques, tools and algorithms: an overview.

  • Highly Influenced

Web Mining for Information Retrieval

Web mining algorithms, a survey on web mining: techniques and applications, a systematic review web content mining tools and its applications, web page ranking using web mining techniques: a comprehensive survey, content based page ranking by using some natural language processing techniques, web content outlier mining using machine learning and mathematical approaches, semantic web mining for content-based online shopping recommender systems, 16 references, web mining: information and pattern discovery on the world wide web, web content mining techniques-a comprehensive survey, web mining research: a survey, web content mining tools: a comparative study, web mining: a survey of current research, techniques, and software, overview of visualization tools for web browser history data, data mining: introductory and advanced topics, authoritative sources in a hyperlinked environment, related papers.

Showing 1 through 3 of 0 Related Papers

web mining research papers 2021 pdf

Current Journal of Applied Science and Technology

Published: 2023-08-14

DOI: 10.9734/cjast/2023/v42i244179

Page: 32-42

Issue: 2023 - Volume 42 [Issue 24]

Review Article

Exploring the Landscape of Web Data Mining: An In-depth Research Analysis

Laxmi Choudhary *

Computer Science, Sabarmati University, Ahmedabad, India.

Shashank Swami

Department of Computer Science, Sabarmati University, Ahmedabad, India.

*Author to whom correspondence should be addressed.

The exponential growth of Web services and Web-based applications has led to an enormous volume of data, providing a rich source for mining valuable insights. Web mining differs from traditional data mining due to the unique nature of the data it handles. Web data exists in diverse forms, including web server logs, news pages, and hyperlinks. As the usage of the internet continues to surge, web mining has become essential to extract meaningful information and patterns from these varied data sources. Traditional data mining methods may not be directly applicable to web data due to its unstructured and heterogeneous nature. Web server logs contain valuable information about user interactions, click-streams, and user preferences, which can be mined to understand user behavior and improve website performance. News pages and other forms of web content are valuable sources for sentiment analysis, topic modeling, and information retrieval, helping businesses and researchers gain insights into public opinions and trends. Additionally, web structure mining deals with the analysis of hyperlinks, enabling the discovery of relationships between web pages and identifying authoritative sources. The continuous growth of web-based data necessitates the use of specialized methods in web mining to effectively extract knowledge and valuable patterns. Researchers and practitioners in this field are constantly exploring innovative techniques to make sense of the vast amount of data available on the World Wide Web. The paper provides web mining techniques on web data and presenting the latest advancements, researchers and practitioners can gain insights into the state of the field and identify potential areas for further exploration. This paper also reports the comparisons and summary of various methods of web data mining with applications, which gives the overview of development in research and some important research issues.

Keywords: Information retrieval, semantic web, text mining, web crawling, web mining, web content mining, web data mining, web structure mining, web usage mining

How to Cite

  • Download Citation
  • Endnote/Zotero/Mendeley (RIS)

Margaret H. Dunham, ―Data Mining Introductory & Advanced Topics‖, Pearson Education.

Mustafa Ali Bamboat, Ghulam Sarfaraz Khan, Naadiya Mirbahar, Sheeba Memon, “Web Content Mining Techniques for Structured Data: A Review” ( SJHSE) Sindh Journal of Headways in Software Engineering. 2022;1(1).

Richlin Selina Jebakumari A. Nancy Jasmine Goldena. A Survey on Web Content Mining Methods and Applications for Perfect Catch Responses. International Research Journal of Engineering and Technology (IRJET). 2019;06(01): 407-412. e-ISSN: 2395-0056 p-ISSN: 2395-0072.

O Etzioni. The world wield web: Quagmire or Gold Mining. Communicate of the ACM. 1996;39:11:65-68.

Kosala and Blockeel, “Web mining research: A survey”, SIGKDD:SIGKDD Explorations: Newsletter of the Special Interest Group (SIG) on Knowledge Discovery and Data Mining, ACM. 2000;2.

Qingyu Zhang and Richard s. Segall, “Web mining: a survey of current research, Techniques, and software”, in the International Journal of Information Technology & Decision Making. 2008;7(4): 683–720.

Sharma PS, Yadav D, Thakur RN. Web Page Ranking Using Web Mining Techniques: A Comprehensive Survey”. In M. P. Kumar Reddy (Ed.), Mobile Information Systems. 2022;2022:1–19. Hindawi Limited. Available: https://doi.org/10.1155/2022/7519573 .

Kumar S, Kumar R. A Study on Different Aspects of Web Mining and Research Issues. In IOP Conference Series: Materials Science and Engineering. 2021;1022(1):012018. IOP Publishing.

Available: https://doi.org/10.1088/1757- 899x/1022/1/012018.

Andemariam Mebrahtu, Balu Srinivasulu. Web Content Mining Techniques and Tools;2017. IJSCMC

URL: https://www.ijcsmc.com/docs/papers/April2017/V6I4201725.pdf

Zhang H, Chen Z, Li M, Su Z. Relevance feedback and learning in content-based image search, World Wide Web. 2003;6(2):131–155.

Anil B. Pawar, Madhuri A. Jawale, Chaitanya P. Kale. A Powerful Techniques and Applications of Web Mining” in Intelligent Systems and Computer Technology. 269-276. DOI:10.3233/APC200153.

Wang Bo, Xu Jing. Research on Web Data Mining Hadoop Simulation Platform Based on Cloud Computing", Electronic Design Engineering. 2018;26(2): 22-25.

Chen L, Lian W, Chue W. Using web structure and summarization techniques for web content mining, Inform. Process. Management: Int. J. 2005;41(5):1225–1242.

Kavita, Mahani P, Ruhil N. Web data mining: A perspective of research issues and challenges. In international conference on computing for sustainable global development. 2016;3235-8. IEEE.

Yu-Hui Tao, Tzung-Pei Hong, Yu-Ming Su. Web usage mining with intentional browsing data” in international journal of Expert Systems with Applications. 2007;34:1893–1904.

Ramakrishna, Gowdar et al. Web Mining: Key Accomplishments, Applications and Future Directions”, in the International Conference on Data Storage and Data Engineering; 2010.

Furnkranz J. Web structure mining — exploiting the graph structure of the worldwide web, OGAI-J. 2002;21(2):17–26.

Nacim Fateh Chikhi, Bernard Rothenburger, Nathalie Aussenac-Gilles. A Comparison of Dimensionality Reduction Techniques for Web Structure Mining. Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence. 2007;116-119.

Singh B, Singh HK. Web data mining research: a survey. In international conference on computational intelligence and computing research. 2010;1-10. IEEE.

Sunil kumar T, Suvarchala K. A Study: Web Data Mining Challenges and Application for

Information Extraction, IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727. 2012;7(3):24-29.

Just J. A Short Survey of Web Data Mining WDS'13 Proceedings of Contributed Papers, Part I, 2013;59–62. ISBN 978-80-7378-250-4, MATFYZPRESS.

Jianhan Zhu, Jun Hong et al. Using Markov Models for Web Site Link Prediction” College Park, Maryland, USA ACM. 2002;11-15.

© Copyright 2010-Till Date, Current Journal of Applied Science and Technology. All rights reserved.

  • Skip to main content
  • Skip to search
  • Skip to footer

Products and Services

Two workers at a desk reviewing a Provider Connectivity Assurance dashboard

Cisco Provider Connectivity Assurance

Gain end-to-end visibility and insights like never before..

Create exceptional digital experiences built on deep network observability and critical network monitoring.

Service assurance that's proactive and precise—a win-win

Boost efficiency and revenue with Cisco Provider Connectivity Assurance (formerly Accedian Skylight), delivering service assurance that continuously monitors and optimizes digital experiences.

Simplified operations and troubleshooting

Gain a single view of granular performance metrics and third-party data to accelerate MTTR.

Continuous optimization of digital experiences

Find transient issues with precise synthetic network and service testing.

Differentiated services, next-level innovation

Premium, SLA-backed services and end-customer portals are just a click away.

Predictive analytics powered by AI

Be proactive, not reactive. Put the power of your multivendor infrastructure to work for a trouble-free network.

Drive additional sales revenue

See more revenue from portal fees and capacity purchases when you give customers deeper visibility and transparency with Cisco Provider Connectivity Assurance.

See what sets Cisco Provider Connectivity Assurance apart

Granular, real-time visibility.

Perform end-to-end and full mesh testing with microsecond precision and make hidden microbursts a thing of the past.

Reliability meets flexibility

Enjoy 99.997% uptime and take full control over network performance for greater reliability.

Optimize your infrastructure

Automatically correlate sensor metrics with third-party sources of performance data to improve ROI and create better digital experiences.

Deliver exceptional services

Ready-made customer portals differentiate services, boost revenue, and enhance satisfaction.

Assuring high-performance critical networks

See how Cisco Provider Connectivity Assurance enables efficient troubleshooting and SLA assurance—all while lowering operational costs.

Cisco Provider Connectivity Assurance use cases

Solve the challenges of fragmented multidomain tools and poor visibility on service quality. Discover how Provider Connectivity Assurance can help you simplify operations.

Real-time SLA visibility

Monitor and police SLAs for accountability, while proactively addressing performance issues.​

Critical network monitoring

Improve operational efficiency with proactive assurance and microsecond visibility for faster issue resolution and better experiences.

B2B service differentiation

Create new revenue opportunities to upsell and differentiate your services with multitenant end-customer portals for real-time service-level agreement (SLA) visibility and alerting. ​

Automated assurance

Detect issues early when you automate assurance for the entire service lifecycle with real-time service visibility and predictive analytics.​

Multilayer assurance

Get a single view of performance across multiple network layers, like segment routing and routed optical networking, to reduce tools and drive down MTTR.​

Mobile backhaul and edge monitoring

Optimize digital experiences with real-time service visibility, while assuring your end-to-end 5G transport.​

Get high-precision service assurance with Cisco Provider Connectivity Assurance

Ai-enabled predictive analytics.

Get an ML-assisted view of your entire network and gain powerful performance insights.

Assurance sensors

Network-wide service performance visibility. Deployable anywhere, at scale.

See why your peers trust Cisco Provider Connectivity Assurance

"we can look at network performance at any level.".

With Cisco Provider Connectivity Assurance, Bouygues Telecom now has a complete "telescopic and microscopic" view of network and service performance in a single tool.

André Ethier, Network Quality Engineer

Bouygues Telecom

"The solution is key in evolving our end-to-end network visibility."

This is extremely powerful in terms of customer experience. It helps to avoid tickets, as customers can see for themselves what happened with their service. This reduces tension and increases customer satisfaction.

Bart Janssens, Senior Specialist, Packet Architecture

Take preventative measures against network degradation.

"With service-centric assurance and granular visibility we can prevent degradations, automate actions for improvements, and better communicate with our customers."

Mahesh Anjan, Senior Product Technology Executive

Better together

Cisco crosswork network automation.

Drive network efficiency and enhance experiences.

Cisco Optics

Plug innovation into your network.

Boost efficiency and service quality

Cisco Provider Connectivity Assurance helps you improve service quality, lower costs, and deliver outstanding user experiences with a single view of service performance across the entire network

IMAGES

  1. (PDF) A Review Of Web Mining Research

    web mining research papers 2021 pdf

  2. Web Mining Research Survey PDF

    web mining research papers 2021 pdf

  3. (PDF) Web Mining Service (WMS), a public and free service for web data

    web mining research papers 2021 pdf

  4. (PDF) An overview of web mining in societal benefit areas

    web mining research papers 2021 pdf

  5. (PDF) Research on Web Data Mining

    web mining research papers 2021 pdf

  6. (PDF) Web Mining Overview, Taxonomy, Literature Survey: A Review

    web mining research papers 2021 pdf

VIDEO

  1. Web Mining Honest Review as of February 14, 2023

  2. Charla Magistral

  3. 5G × Shaanxi Mines: Smarter, Safer, More Efficient

  4. [KBI2015] "Text Mining Research at ITB"

  5. New Online Mining Website 2024 || Free Dogcoin Mining website

  6. Webinar "The Future of Mining Industry Through Technology Development"

COMMENTS

  1. 121116 PDFs

    Explore the latest full-text research PDFs, articles, conference papers, preprints and more on WEB MINING. Find methods information, sources, references or conduct a literature review on WEB MINING

  2. (PDF) Web Mining: A Survey of Current Research, Techniques, and

    Barsagade2 provides a survey paper on web mining usage and pattern discovery. Chau et al.4 discuss personalized multilingual web content mining. Kolari and Joshi24 provide an overview of past and current work in the three main areas of web mining research-content, structure, and usage as well as emerging work in semantic web mining.

  3. (PDF) Web Content Mining Techniques for Structured Data: A Review Web

    Web Mining is an extensive, interdisciplinary, and dynamic area; it consists of three techniques: Web Content Mining, Structured Web Mining, and Web Usage Mining. This paper gives an overview of ...

  4. [PDF] A Systematic Review Web Content Mining Tools and its Applications

    This paper provided a comprehensive survey on the current situation and recent trends on web content mining (WCM) and its applications thereby contributing to the enhancement of the upcoming research in WCM. In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of unstructured and structured ...

  5. PDF Text and Web Content Mining: A Systematic Review

    Text and Web Content Mining: A Systematic Review Fatima Almatrooshi1, Sumayya Alhammadi1, Said A. Salloum2,3(B), and Khaled Shaalan1 1 Faculty of Engineering and IT, The British University in Dubai, Dubai, UAE 2 School of Science, Engineering, and Environment, University of Salford, Salford, UK [email protected] 3 Machine Learning and NLP Research Group, Department of Computer Science,

  6. PDF A Systematic Review Web Content Mining Tools and its Applications

    WCM is utilized in various web applications with intension to identify web objects which have common patterns or characteristics [9, 10]. It is naturally semi-structure format of web. It has two kids: one type directly extracts document's content and another type enhance search of content with tools like search engine.

  7. RQ'LIIHUHQW$VSHFWVRI:HE0LQLQJDQG 5HVHDUFK,VVXHV

    1. A Study on Different Aspects of Web Mining and Research Issues. Santosh Kumar1, Ravi Kumar2. Department of Computer Science and Engineering ABES Engineering College, Ghaziabad, Uttar Pradesh, India-201009. [email protected], [email protected] Corresponding author email: [email protected], Abstract.

  8. Web Mining Research: A Survey

    The Web mining research is a converging research area from several research communities, such as database, IR, and AI research communities especially from machine learning and NLP. This paper is an attempt to put the research done in a more structured way from the machine learning point of view.

  9. PDF Web Mining

    21.1.1 Web Content Mining Web content mining is the process of extracting useful information from the contents of web documents. Content data is the collection of facts a web page is designed to contain. It may consist of text, images, audio, video, or struc-tured records such as lists and tables. Application of text mining to web con-tent has ...

  10. A Systematic Review Web Content Mining Tools and its Applications

    A Systematic Review Web Content Mining Tools and its Applications. International Journal of Advanced Computer Science and Applications (IJACSA), Volume 12 Issue 8, 2021. Abstract: In recent years, the emergence of WWW (World Wide Web) led to the accumulation of huge amount of information and data. Hence the web is found to consist of ...

  11. PDF Web Mining

    Web mining is an important tool for collecting behavior of web site visitors and thus allows for appropriate adjustments and decisions regarding real Web users and traffic patterns. Along with a description of the processes involved in Web mining claim that Web conversion, System Improvement, Web personalization and Business Intelligence are ...

  12. PDF International Journal of Applied Engineering and Management SRINIVAS

    Letters (IJAEML), ISSN: 2581-7000, Vol. 5, No. 1, June 2021 SRINIVAS PUBLICATION Rajeev Kumar, et al, (2021); www.srinivaspublication.com PAGE 106 A Framework of Web Mining Algorithm-Based Hidden Pattern in India E-Government Application Using Blockchain Technology Rajeev Kumar Post Doc Researcher, College of Engineering & Technology,

  13. Web Mining: a Survey of Current Research, Techniques, and Software

    The web mining software selected for discussion and comparison in this paper are SPSS Clementine, Megaputer PolyAnalyst, ClickTracks by web analytics, and QL2 by QL2 Software Inc. Applications of these selected web mining software to available data sets are discussed together with abundant presentations of screen shots, as well as conclusions ...

  14. (PDF) Trends in data mining research: A two-decade review using topic

    Address: 20, Myasnitskaya Street, Moscow 101000, Russia. Abstract. This work analyzes the intellectual structure of data mining as a scientific discipline. T o do this, we use. topic analysis ...

  15. PDF Web Mining: A Framework

    paper is to provide a more evaluative update of web mining research and techniques available. This paper, provide the reviews for concept of Web mining, the type of Web mining and different techniques used in each type. This paper discusses the current trends and challenges in this research area. Keywords— Clustering, Association rules, Pattern

  16. Research on Web Data Mining Based on Topic Crawler

    This paper analyzes the method of Web information data mining based on topic crawler. This paper puts forward the architecture of Web information search and data mining, and introduces the key technology and operation principle of the architecture. After analyzing the functions and shortcomings of ordinary crawler, this paper focuses on the working principle, implementation method and ...

  17. [PDF] A Study on Web Content Mining

    This paper deals with a preliminary discussion of Web content mining, contributions in the field of web mining, the prominent successful tools and algorithms, the prominent successful tools and algorithms. : Web Mining is extracting information from the web re-sources and finding interesting patterns that can be useful from ever expanding database of World Wide Web. Whenever we talk about data ...

  18. (PDF) Web Mining: A survey of current research, techniques, and software

    The web mining software selected for discussion and comparison in this paper are SPSS Clementine, Megaputer PolyAnalyst, ClickTracks by web analytics, and QL2 by QL2 Software Inc. Applications of ...

  19. Exploring the Landscape of Web Data Mining: An In-depth Research

    Wang Bo, Xu Jing. Research on Web Data Mining Hadoop Simulation Platform Based on Cloud Computing", Electronic Design Engineering. 2018;26(2): 22-25. Chen L, Lian W, Chue W. Using web structure and summarization techniques for web content mining, Inform. Process. Management: Int. J. 2005;41(5):1225-1242.

  20. PDF Web Mining

    Data Mining vs Web Mining Data Mining : It is a concept of identifying a significant pattern from the data that gives a better outcome. Web Mining : It is the process of performing data mining in the web. Extracting the web

  21. (PDF) Web Mining Research: A Survey

    With the huge amount of information available online, the World Wide Web is a fertile area for data mining research. The Web mining research is at the cross road of research from several research ...

  22. Cisco Provider Connectivity Assurance

    Get white paper. Take preventative measures against network degradation. "With service-centric assurance and granular visibility we can prevent degradations, automate actions for improvements, and better communicate with our customers." Mahesh Anjan, Senior Product Technology Executive. AT&T. Watch webinar on demand.

  23. (PDF) Web Data Mining research: A survey

    Abstract and Figures. Web Data Mining is an important area of Data Mining which deals with the extraction of interesting knowledge from the World Wide Web, It can be classified into three ...

  24. (PDF) Research on Web Data Mining

    Machine Learning or Data Mining techniques to learn. the extraction patterns or rules for Web documents. semi-automatically or automatically. Within this view, Web Mining is part of the (Web) IE ...