Web Business Intelligence Research Articles

The World Wide Web is an increasingly important data source for business decision making; however, extracting information from the Web remains one of the challenging issues related to Web business intelligence applications. To use heterogeneous Web data for decision making, documents containing relevant data must be located, and the data of interest within the documents must be identified and extracted. Currently, most automatic information extraction systems can only cope with a limited set of document formats or do not adapt well to changes in document structure, as a result, many real-world data sources with complex document structures cannot be consistently interpreted using a single information extraction system. This paper presents an adaptive information extraction system prototype that combines multiple information extraction approaches to allow more accurate and resilient data extraction for a wide variety of Web sources. The Amorphic Web information extraction system prototype can locate data of interest based on domain knowledge or page structure, can automatically generate a wrapper for a data source, and can detect when the structure of a Web-based resource has changed and act on this to search the updated resource to locate the desired data. The prototype Amorphic information extraction system demonstrated improved information extraction accuracy for the four different extraction scenarios examined when compared with traditional data extraction approaches

Read full abstract

It is estimated that over seven billion static pages exist in the Web today, and backend databases can potentially produce at least three times as many dynamic pages. However, the best search engines index only approximately 20% of the static pages. So the real question is: While the Web is certainly the most amazing and comprehensive information source ever created, are you really getting all the information you need for your specific purpose? The answer to this question is mostly “yes” for the individual user, who uses the Web as an information source for casual purposes. However, for an individual who uses the Web as an essential and comprehensive source of information—for business or research—the answer is quite the opposite. Even a sophisticated Web user requires a significant amount of time and effort to find all of the information needed for a given task. In this paper the concept of Web Business Intelligence (WBI) is introduced, an emerging class of software that leverages the unprecedented content on the Web to extract actionable knowledge in an organizational setting. The contributions include an architecture for WBI, a survey of technologies relevant to the various components of the architecture, and illustration of the value of WBI by means of a detailed example from the e-finance domain. This article concludes with a discussion on the future of WBI.

Read full abstract

Web Business Intelligence Research Articles

Articles published on Web Business Intelligence

Exploiting the Information Web

Business intelligence for new market development: a web semantic network analysis approach

Links to commercial websites as a source of business information

Web Business Intelligence: Mining the Web for Actionable Knowledge

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Web Business Intelligence Research Articles

Articles published on Web Business Intelligence

Exploiting the Information Web

Business intelligence for new market development: a web semantic network analysis approach

Links to commercial websites as a source of business information

Web Business Intelligence: Mining the Web for Actionable Knowledge