Web Content Mining with Java - Techniques for Exploiting the Worlds Biggest Information Resource
-
Author:
-
Subject:
-
Published by:John Wiley & Sons (UK)
-
Published:28/03/2002
-
Price:$92.99
- < Buy this book >
This practical book shows you how to build portals, construct search engines and other knowledge-based applications to mine the information you need from the Web.
* Written by a developer for developers
* A practical, hands-on approach
* Illustrates how Java associated tools (XML, HTML) can be combined with database technology to display and manipulate Web-derived information more effectively.
* Demonstrates how to build a structure browser, portal, meta-search engine and how to make 'Talking Pages'
Table of Contents
About the Author.
Acknowlegements.
Surveying the Scene
Language of the Web
HTML and XML Parsing
Data Filters and Structured Queries
Building a Portal with Java
Building a Search Engine with Java
Mail Mining with Java
Introduction to Text Mining
Introduction of Data Mining
Loose Ends and Looking Ahead
Appendix A: Software Installation and Configuration
Appendix B: Javadoc Extracts
Appendix C: Earlier Versions of JAXP
Appendix D: License and Copyright Statements
Appendix E: Census 1891Data XML
Appendix F: Share Price Cluster Data
Appendix G: Glossary of Acronyms
References
Further Reading
Index
- FTMobile Portal Architect - .Net TechnologiesNSW
- FTAccount Manager - Strategic Enterprise DevelopmentNSW
- CCDB2 / DBA Technical Consultant - Finance company - Melbourne CBD - DB2VIC
- FTSenior .Net Developer - Mobility/Portal SolutionsNSW
- CCDigital Business Analyst - Agile/ScrumNSW
- FTDigital Account ManagerNSW
- FTTechnical Operations ManagerNSW
- FTSupport Consultant - Global Vendor - $55-75,000NSW
- FTDigital Account ManagerNSW
iAsset is a channel management ecosystem that automates all major aspects of the entire sales,marketing and service process, including data tracking, integrated learning, knowledge management and product lifecycle management.
Aberdeen Group: Building Business Resilience Through Active Archive
One of the key data management challenges organizations often face is how to keep their archived data accessible and active, without spending the time and resources associated with primary storage. The amount of data in the archives can range from one half to 10 times the amount of data actively managed in primary storage. How can end-users gain access to historical files in a reasonable amount of time without pulling IT employees from higher priority projects? Aberdeen's research found the answer in the technologies and processes that comprise active archiving.
HiveManager Online: Less Dollars, More Sense
Today’s de facto standard controller-based Wi-Fi infrastructure model is just too complicated, too expensive, and too unreliable. It’s common for enterprise and mid-market network operators alike to get caught in a crossroads of compromises involving costs, complexity, features, and reliability.








