Kolloquium des Lehrstuhls für Systeme der Informationsverwaltung anlässlich des 80. Geburtstags von Herrn Prof. em. Dr.-Ing. Dr. h.c. Lockemann

Prof. em. Dr.-Ing. Dr. h.c. Peter Lockemann

Der Lehrstuhl für Systeme der Informationsverwaltung des Karlsruhe Instituts für Technologie (KIT) freut sich, Ihnen anläßlich des 80. Geburtstags seines ehemaligen Leiters, Herrn Prof. Lockemann, dieses Programm eines wissenschaftlichen Kolloquiums präsentieren zu dürfen. Die Vortragenden sind ehemalige und derzeitige Mitarbeiter des Lehrstuhls, die über einen Ausschnitt ihrer aktuellen Forschungsaktivitäten berichten. Zu dem Kolloquium sind derzeitige und ehemalige Mitarbeiter des Lehrstuhls und der von Herrn Prof. Lockemann geleiteten Gruppe am FZI, weitere interessierte KIT- und FZI-Angehörige,  Persönlichkeiten aus der hiesigen IT-Industrie und alle Herrn Prof. Lockemann persönlich verbundenen Personen eingeladen.

Klemens Böhm

 

 

 

 

 
Date Friday, 20.11.2015
Location Room -101, Informatics Building 50.34 (Directions or googlemaps)

 

 

Program

14:00   Welcome

 
14:15  Efforts and Challenges in Arabic Text Mining and Information Retrieval  

Text mining and information retrieval of Arabic content poses a great scientific challenge due to three main aspects. The nature of the Arabic language makes it impossible to directly apply the techniques and algorithms used in Western languages. The language lacks the availability of large tagged corpora. Despite the tremendous increase of Arabic content in the Internet, existing corpora are small and are limited to a set of specific domains. Last but not least, the community of researchers working on this field is relatively small as compared to Western languages. This results in the sparseness of algorithms to build upon and enhance. We present our contribution to this localized area of information retrieval, specifically in creating facets hierarchies for unstructured Arabic documents, sentiment analysis of colloquial Arabic tweets, and relation extraction using distant supervision methods.

 

Speaker Bio:
1973: Born in Karlsruhe.

1989: General Certificate of Education, UK.

1994: Bachelor of Science, Computer Science Department, Faculty of Engineering, Alexandria University.

1997: Master of Science. Computer Science Department, Faculty of Engineering, Alexandria University.

1998: Aufbaustudium Informatik an der Fakultät für Informatik, Universität Karlsruhe.

2001: Doctor of Engineering, IPD, Universität Karlsruhe. Thesis title: Transactional Agents: Towards a Robust Multi-Agent System.

since 2002: Lecturer at the Dept. of Computer and Systems Engineering, Faculty of Engineering, Alexandria University, Egypt.

2005: Co-founder of POET Egypt, LLC. A sister company of POET GmbH in Karlsruhe. POET Egypt, LLC is the nearshore development partner to POET GmbH and other software companies in the region of BW.

since 2009: Associate Professor at the Dept. of Computer and Systems Engineering, Faculty of Engineering, Alexandria University, Egypt.

 

Prof. Dr.
Khaled Nagi

14:45  Coffee Break  
   
15:15  Lifecycle Support for Scientific Data  

Data plays an increasingly important role in scientific progress. Thus, proper management of this valuable resource along its entire lifecycle is crucial:  In this talk, we will look at different stages of the lifecycle from experiment planning and data acquisition, to discovery, integration, analysis and publication, and discuss challenges and solutions.

 

Speaker Bio:

Birgitta König-Ries holds the Heinz-Nixdorf Chair for Distributed Information Systems at Friedrich-Schiller-Universität Jena. Before joining FSU, she was working with the TU Munich, Universität Karlsruhe (this is where she obtained both her PhD and her diploma degrees), Florida International University, and the University of Louisiana at Lafayette. Her main research interest is on different aspects of information integration. In the last few years, her group has worked on this topic mostly in the context of scientific data management.

 

Prof. Dr. Birgitta König-Ries

15:45  HyPer: one DBMS for all - High-Performance OLTP AND OLAP on Brawny Servers AND Wimpy Devices  

Ever increasing main memory capacities and processors with multiple cores have fostered the development of database systems that process and store data solely in main memory. This talk presents HyPer, a high-performance hybrid OLTP&OLAP main memory database system we are building at TUM. Unlike other main memory database systems, HyPer aims at providing highest performance for both, OLTP AND OLAP workloads on brawny AND wimpy systems. OLAP query processing is separated from mission-critical OLTP transaction processing using an efficient virtual memory (VM) snapshotting mechanism. Platform-independent high-performance OLTP and OLAP is achieved by efficiently compiling transactions and queries into efficient target machine code. Even though the SQL-92 standard, a PL/SQL-like scripting language, and ACID-compliant transactions are supported, HyPer has a memory footprint of just a few megabytes. This talk highlights the following recent research efforts in the HyPer project: Snapshotting a transactional database, indexing, parallel multi-core query evaluation, multi-version concurrency control.

This is joint work with my colleague Prof. Thomas Neumann and the HyPer team: www.hyper-db.de


Speaker Bio:

Alfons Kemper's research field is database systems engineering. He explores ways to optimize information systems for operational and scientific applications as a way to combat the data explosion. His main areas of interest are optimization concepts for distributed information structures, data integration methods and, in particular, main memory-based database systems. Together with his colleague Thomas Neumann he leads the HyPer main-memory database project (hyper-db.com)  at Technische Universität München. HyPer is one of the first hybrid database systems that offers high-performance OLTP as well as OLAP in parallel on the same database state.  

After studying computer science at the University of Dortmund from 1977 to 1980, he moved to the University of Southern California, Los Angeles. While there, he obtained his Master of Science and doctorate. Upon his return to Germany, he completed his lecturer qualification at the University of Karlsruhe. His first professorship was conferred by RWTH Aachen. After many years as Director of the Chair of Database Systems at the University of Passau, TUM offered him a position in 2004. From 2006 to 2010, he was Dean of the Department of Informatics at TUM. His textbook on database systems, published by deGruyter and now in its 10th edition, is a best-seller in German-speaking countries and is used in most universities and colleges.
 

Prof. Dr. Alfons Kemper


 

 

16:15 Coffee Break
 

 

16:45  Energy Status Data - Informatics Methods for its Collection, Analysis and Exploitation

Designing future energy system is a fundamentally important concern of our society. These systems must be able to cope with fluctuating supply of renewable energy as well as with flexible demand, including the one of big systems such as industrial plants. To organize the supply of locally generated energy in a better manner, it is necessary to collect, analyze and utilize so-called energy-status data in a structured fashion. Energy-status data describe all aspects of energy systems, be it measured/metered values, be it derived values such as the degree of aging of batteries. The German National Science Foundation (DFG) has recently granted a research training program "Energy Status Data – Informatics Methods for its Collection, Analysis, and Exploitation", addressing exactly these issues. This talk will present its objectives as well as results of some early work by its organizers.

 

Speaker Bio:

Since 2004, Klemens Böhm holds the chair of databases and information systems at Karlsruhe Institute of Technology (KIT). Prior to that, he has been professor at Otto-von-Guericke-Universität Magdeburg, senior researcher at the Swiss Federal Institute of Technology Zurich (ETH), and researcher at GMD in Darmstadt. He has completed his PhD at Technical University of Darmstadt in 1997. Current research topics of his chair include big data analytics, data privacy and workflow management. Klemens is speaker of the newly established DFG research training program (Graduiertenkolleg) "Energy Status Data -- Informatics Methods for its Collection, Analysis and Exploitation" at KIT. He gives much attention to collaborations with scientists from other disciplines and with industry.

Prof Dr. Klemens Böhm

 

 

 

 

17:15  Ex Astris Scientia - Identifying Hot Spots of User Interest within an Astronomy Database

 

Many scientific databases nowadays are publicly available for querying and advanced data analytics. A well-known example is the Sloan Digital Sky Survey, named SkyServer offering data to astronomers and the general public. For such data, it is important to understand the public focus and trending research directions on the subject described by the database. Due to the large number of users, it is a non-trivial task to determine what the users are really interested in and whether the database is used as intended. In a current line of research, we aim at answering this question efficiently by analyzing SQL log data. We define user interest based on access areas. To our knowledge, the concept of access areas and their extraction from SQL log data has not been proposed before. They offer a unified representation of user interest and can be easily extracted from SQL log data. Moreover, they are independent of any specific database state and are computed solely on the log and thus without imposing any additional load on the database itself. Finally, they can be extracted from anonymized SQL-log data, and hence do not impose a privacy risk in case of confidential data. A clustering of the SkyServer access areas reveals hot spots within the data space that are interesting for many astronomers for "some particular" reason. In most cases, these clusters only refer to a small part of the data space (less than one percent), and thus contain important information for tuning opportunities. Another interesting result is that some clusters refer to non-existing data, and hence mark missing data that would be of interest.

Speaker Bio:

Martin Schäler was born in 1985 in Havelberg (Germany). He received his Master degree from the Otto-von-Guericke University Magdeburg in 2010. Afterwards he was employed as a research assistant and scientific coordinator at the Otto-von-Guericke University Magdeburg receiving his Ph.D. degree in 2014. Since August 2015 he is a post-doctoral researcher at the DBIS working group (Lehrstuhl für Systeme der Informationsverwaltung) of Prof. Böhm at the Karlsruhe Institute of Technology. His research interests include Multi-Dimensional Access Methods, Hardware-sensitive Database Tuning, and Provenance.

Dr. Martin Schäler

 

17:45 Discussion und Closing

 

19:00 Social Event at Restaurant Aurum (only with registration)