Data and knowledge base research at Hong Kong University of Science and Technology

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

The National Technical University of Athens (NTUA) is the leading Technical University in Greece. The Computer Science Division of the Electrical and Computer Engineering Department covers several fields of practical, theoretical and technical computer science and is involved in several research projects supported by the EEC, the government and industrial companies. The Knowledge and Data Base Systems (KDBS) Laboratory was established in 1992 at the National Technical University of Athens. It is recognised internationally, evidenced by its participation as a central node in the Esprit Network of Excellence IDOMENEUS. The Information and Data on Open MEdia for NEtworks of USers, project aims to coordinate and improve European efforts in the development of next-generation information environments which will be capable of maintaining and communicating a largely extended class of information in an open set of media. The KDBS Laboratory employs one full-time research engineer and several graduate students. Its infrastructure includes a LAN with several DECstation 5000/200 and 5000/240 workstations, an HP Multimedia Workstation, several PCs and software for database and multimedia applications. The basic research interests of our Laboratory include: Spatial Database Systems, Multimedia Database Systems and Active Database Systems . Apart from the above database areas, interests of the KDBS Laboratory span several areas of Information Systems, such as Software Engineering Databases, Transactional Systems, Image Databases, Conceptual Modeling, Information System Development, Temporal Databases, Advanced Query Processing and Optimization Techniques. The group's efforts on Spatial Database Systems, include the study of new data structures, storage techniques, retrieval mechanisms and user interfaces for large geographic data bases. In particular, we look at specialized, spatial data structures (R-Trees and their variations) which allow for the direct access of the data based on their spatial properties, and not some sort of encoded representation of the objects' coordinates. We study implementation and optimization techniques of spatial data structures and develop models that make performance estimation. Finally, we are investigating techniques for the efficient representation of relationships and reasoning in space. The activities on Multimedia Database Systems, include the study of advanced data models, storage techniques, retrieval mechanisms and user interfaces for large multimedia data bases. The data models under study include the object-oriented model and the relational model with appropriate extensions to support multimedia data. We are also investigating content-based search techniques for image data bases. In a different direction, we are studying issues involved in the development of multimedia front-ends for conventional, relational data base systems. In the area of Active Database Systems, we are developing new mechanisms for implementing triggers in relational databases. Among the issues involved, we address the problem of efficiently finding qualifying rules against updates in large sets of triggers. This problem is especially critical in database system implementations of triggers, where large amounts of data may have to be searched in order to find out if a particular trigger may qualify to run or not. Continuing work that started at the Foundation for Research and Technology (FORTH), Institute of Computer Science, the group is investigating reuse-oriented approaches to information systems application development. The approaches are based on a repository that has been implemented at FORTH as a special purpose object store, with emphasis on multimodal and fast retrieval. Issues of relating and describing software artifacts (designs, code, etc.) are among the topics under investigation. A new important research direction of the group is on Data Warehouses, which are seen as collections of materialized views captured over a period of time from a heterogeneous distributed information system. Issues such as consistent updates, data warehouse evolution, view reconciliation and data quality are being investigated. Research in Image Databases deals with the retrieval by image content, that uses techniques from the area of Image Processing. We are currently at early stage in this direction, having collected many segmentation and edge detection algorithms, which will be used and evaluated in images of various contents. Our work on Advanced Query Processing and Optimization Techniques includes dynamic or parametric query optimization techniques. In most database systems, the values of many important runtime parameters of the system, the data, or the query are unknown at query optimization time. Dynamic, or parametric, query optimization attempts to identify several execution plans, each one of which is optimal for a subset of all possible values of the run time parameters. In the next sections we present in detail our research efforts on the three main research areas of the KDBS Laboratory: Spatial, Multimedia and Active Databases.

Similar Papers
  • Research Article
  • Cite Count Icon 4
  • 10.1016/0950-5849(95)98924-5
Query evaluation and optimization in deductive and object-oriented spatial databases
  • Jan 1, 1995
  • Information and Software Technology
  • Wei Lu + 1 more

Query evaluation and optimization in deductive and object-oriented spatial databases

  • Conference Article
  • Cite Count Icon 1
  • 10.1109/iccsit.2010.5564420
A query optimization trategy for implementing multi dimensional model in spatial database system
  • Jul 1, 2010
  • Animesh Tripathy + 2 more

Spatial Data Warehouses (SDWs) combine spatial databases (SDBs) and data warehouses (DWs) allowing analysis of historical data. This data can be queried using Spatial On-Line Analytical Processing (SOLAP). SDW and SOLAP systems are emerging areas that raise several research issues. In this paper, we refer to a different problem existing in SDWs that motivated us to propose a framework - a conceptual multidimensional model able to express users' requirements for SDW and SOLAP applications. We present a different research direction that is important to consider providing satisfactory solutions for SDW and for SOLAP systems. That important area is spatial query optimization. For the past several years, the research on spatial database systems has actively progressed because the applications using the spatial information such as geographic information systems (GIS), computer aided design (CAD) and multimedia systems have increased. However, most of the research has dealt with only a part of spatial database systems such as data models, spatial indexes, spatial join algorithms, or cost models. There has been a little research on the spatial query optimization which can integrate them. Most of the spatial query optimization techniques published until now has not properly reflected the characteristics of the SDBs. This paper presents query optimization strategies which take the characteristics of SDBs into account. The application of standard query processing and optimization techniques in the context of an integrated SDB environment is discussed.

  • Book Chapter
  • Cite Count Icon 4
  • 10.1007/978-981-15-0222-4_30
Big Data and Query Optimization Techniques
  • Jan 1, 2020
  • Aarti Chugh + 2 more

With rise in new technologies like Big Data, Internet of Things, Sensor Networks, etc. new query processing and optimization techniques have been evolved which handles the complex and interactive queries. Traditional query processing systems are not capable enough to handle queries on big data. In this paper, we represent many of the big data querying techniques along with the comparative analysis. Finally, the paper concludes with list of tools/techniques that are used for big data query optimization.

  • Research Article
  • Cite Count Icon 4
  • 10.1016/s0164-1212(02)00215-2
An efficient query optimization strategy for spatio-temporal queries in video databases
  • Dec 5, 2003
  • Journal of Systems and Software
  • Gülay Ünel + 3 more

An efficient query optimization strategy for spatio-temporal queries in video databases

  • Research Article
  • Cite Count Icon 3
  • 10.1016/s0164-1212(03)00215-2
An efficient query optimization strategy for spatio-temporal queries in video databases
  • Dec 5, 2003
  • The Journal of Systems & Software
  • G Ünel + 3 more

An efficient query optimization strategy for spatio-temporal queries in video databases

  • Book Chapter
  • Cite Count Icon 1
  • 10.1007/0-387-25229-0_2
Query Execution and Optimization
  • Jan 1, 2005
  • Stratis D Viglas

Query execution and optimization for streaming data revisits almost all aspects of query execution and optimization over traditional, disk-bound database systems. The reason is that two fundamental assumptions of disk-bound systems are dropped: (i) the data resides on disk, and (ii) the data is finite. As such, new evaluation algorithms and new optimization metrics need to be devised. The approaches can be broadly classified into two categories. First, there are static approaches that follow the traditional optimize-then-execute paradigm by assuming that optimization-time assumptions will continue to hold during execution; the environment is expected to be relatively static in that respect. Alternatively, there are adaptive approaches that assume the environment is completely dynamic and highly unpredictable. In this chapter we explore both approaches and present novel query optimization and evaluation techniques for queries over streaming sources.

  • Research Article
  • 10.1016/0020-0255(90)90024-5
Distributed query processing and optimization techniques for a hierarchically structured computer network
  • Jul 1, 1990
  • Information Sciences
  • Mingsen Guo + 2 more

Distributed query processing and optimization techniques for a hierarchically structured computer network

  • Research Article
  • Cite Count Icon 3
  • 10.1016/j.protcy.2012.05.125
New Database Architecture for Smart Query Handler of Spatial Database
  • Jan 1, 2012
  • Procedia Technology
  • Parthasarathi Boyal + 1 more

New Database Architecture for Smart Query Handler of Spatial Database

  • Conference Article
  • Cite Count Icon 40
  • 10.1109/icde.2002.994788
Decoupled query optimization for federated database systems
  • Aug 7, 2002
  • A Deshpande + 1 more

We study the problem of query optimization in federated relational database systems. The nature of federated databases explicitly decouples many aspects of the optimization process, often making it imperative for the optimizer to consult underlying data sources while doing cost-based optimization. This not only increases the cost of optimization, but also changes the trade-offs involved in the optimization process significantly. The dominant cost in the decoupled optimization process is the "cost of costing" that traditionally has been considered insignificant. The optimizer can only afford a few rounds of messages to the underlying data sources and hence the optimization techniques in this environment must be geared toward gathering all the required cost information with minimal communication. In this paper, we explore the design space for a query optimizer in this environment and demonstrate the need for decoupling various aspects of the optimization process. We present minimum-communication decoupled variants of various query optimization techniques, and discuss tradeoffs in their performance in this scenario. We have implemented these techniques in the Cohera federated database system and our experimental results, somewhat surprisingly, indicate that a simple two-phase optimization scheme performs fairly well as long as the physical database design is known to the optimizer, though more aggressive algorithms are required otherwise.

  • Conference Article
  • Cite Count Icon 1
  • 10.1109/icisc.2018.8399085
Advanced query processing and its optimization for mobile computing environment
  • Jan 1, 2018
  • A.P Chendke + 2 more

Query processing is that method or technique in area of mobile atmosphere which deal using mobile environment. This research in that way user can think advancement using query processing and it takes more use for speedily accessing global query processing and it is joint venture of query processing between dissimilar sites includes fixed server movable computer. The necessity saving of energy and also the existence of lopsided features in a moveable computing atmosphere, the predictable query processing for a scattered database cannot be straightly genuine to a moveable computing organization. The mobile environment is a collection of mobile diverse hosts, which are enabled to communicate using wireless links. These wireless links are change giving to the natures of mobile networks. Moreover, nodes in the ad-hoc network have to link without any centralized or help. The usability for the user also changes using query processing tool. Thus, this mechanism that allows the sharing of functionality among different devices in same environment change the way of user communication for searching fast query time in mobile computing environment. The aim of this research is to make innovation in query optimization technique in mobile computing environment. This research focuses on improvement of various query optimization techniques for highly effective and trustworthy query optimization methods.

  • Book Chapter
  • Cite Count Icon 6
  • 10.1007/978-3-540-24709-8_113
Extension of Geography Markup Language (GML) for Mobile and Location-Based Applications
  • Jan 1, 2004
  • Young Soo Ahn + 3 more

Geography Markup Language (GML) has been extended for mobile and location-based applications. Three schemas are included in the extension, i.e., voice schema, tracking schema, and POI (Point Of Interest) schema. In order to handle effectively the extension, we also have designed a Spatial XQuery language and its processing modules. Because our work is based on a spatial database system, the Spatial XQuery statements are first translated into SQL statements. By working on an existing spatial database system, we can use of the existing amenities of database systems such as query optimization, spatial indexes, concurrency control, and crash recovery. Translation of the Spatial XQuery into SQL has been explained using examples. Because the results from the spatial database system are in the form of tables, we need to translate again the results into XML statements. A working example of the proposed system as an Emergency Support System is also presented. Prospected application areas of the proposed system are almost all mobile and location-based systems such as LBS (Location-based System), POI systems, tracking systems, ubiquitous systems, and distributed control and management systems.

  • Conference Article
  • Cite Count Icon 30
  • 10.1109/icde.1995.380406
Semantic query optimization for methods in object-oriented database systems
  • Mar 6, 1995
  • K Aberer + 1 more

S.70-79

  • Book Chapter
  • Cite Count Icon 1
  • 10.1007/978-3-030-12981-1_42
How Good is Query Optimizer in Spark?
  • Jan 1, 2019
  • Zujie Ren + 6 more

In the big data community, Spark plays an important role and is used to process interactive queries. Spark employs a query optimizer, called Catalyst, to interpret SQL queries to optimized query execution plans. Catalyst contains a number of optimization rules and supports cost-based optimization. Although query optimization techniques have been well studied in the field of relational database systems, the effectiveness of Catalyst in Spark is still unclear. In this paper, we investigated the effectiveness of rule-based and cost-based optimization in Catalyst, meanwhile, we obtained a set of comparative experiments by varying the data volume and the number of nodes. It is found that even when applied query optimizations, the execution time of most TPC-H queries were slightly reduced. Some interesting observations were made on Catalyst, which can enable the community to have a better understanding and improvement of the query optimizer in Spark.

  • Conference Article
  • Cite Count Icon 2
  • 10.1109/icde.2005.61
Energy-Efficient Data Organization and Query Processing in Sensor Networks
  • Apr 19, 2005
  • R Gummadi + 4 more

Recent sensor networks research has produced a class of data storage and query processing techniques called Data-Centric Storage that leverages locality-preserving distributed indexes like DIM, DIFS, and GHT to efficiently answer multi-dimensional range and rangeaggregate queries. These distributed indexes offer a rich design space of a) logical decompositions of sensor relation schema into indexes, as well as b) physical mappings of these indexes onto sensors. In this poster, we explore this space for energy-efficient data organizations (logical and physical mappings of tuples and attributes to sensor nodes) and devise purely local query optimization techniques for processing queries that span such decomposed relations. We propose four design techniques: (a) fully decomposing the base sensor relation into distinct sub-relations, (b) spatially partitioning these sub-relations across the sensornet, (c) localized query planning and optimization to find fully decentralized optimal join orders, and (d) locally caching join results. Together, these optimizations reduce the overall network energy consumption by 4 times or more when compared against the standard single multidimensional distributed index on a variety of synthetic query workloads simulated over both synthetic and real-world datasets. We validate the feasibility of our approach by implementing a functional prototype of our data organizer and query processor on Mica2 motes and observing comparable message cost savings.

  • Conference Article
  • Cite Count Icon 1
  • 10.1145/1031495.1031532
Energy-efficient data organization and query processing in sensor networks
  • Nov 3, 2004
  • Ramakrishna Gummadi + 4 more

Recent sensor networks research has produced a class of data storage and query processing techniques called Data-Centric Storage that leverages locality-preserving distributed indexes like DIM, DIFS, and GHT to efficiently answer multi-dimensional range and range-aggregate queries. These distributed indexes offer a rich design space of a) logical decompositions of sensor relation schema into indexes, as well as b) physical mappings of these indexes onto sensors. In this poster, we explore this space for energy-efficient data organizations (logical and physical mappings of tuples and attributes to sensor nodes) and devise purely local query optimization techniques for processing queries that span such decomposed relations. We propose four design techniques: (a) fully decomposing the base sensor relation into distinct sub-relations, (b) spatially partitioning these sub-relations across the sensornet, (c) localized query planning and optimization to find fully decentralized optimal join orders, and (d) locally caching join results. Together, these optimizations reduce the overall network energy consumption by 4 times or more when compared against the standard single multi-dimensional distributed index on a variety of synthetic query workloads simulated over both synthetic and real-world datasets. We validate the feasibility of our approach by implementing a functional prototype of our data organizer and query processor on Mica2 motes and observing comparable message cost savings.

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.