Master student projects
The MonetDB experimentation platform creates a challenging environment
to conduct research and development. The student projects range from
development tasks for broader dissemination to
projects geared at preparing for a PhD track.
Support for Scientific Data(ca 3 months)
Predominant scientific formats are HDF5 and NETCDF.
They are produced by scientific experiments, simulations
and consumed by GUIs. The target of this project is
to extend the MonetDB kernel with a flexible scheme
to make the content of these files accessible at the MAL
and SQL level. It would greatly expand the applicability
of MonetDB's user base.
MonetDB/tellme (6 months)
Databased query languages presuppose the user to be aware of
the database content, both its schema and value distribution.
A grand challenge is to squeeze SQL result sets into a summary
to aid the user in formulation of better queries.
(Java project)
MonetDB/Viewport (3-6 months)
The amount of numeric data stored in scientific- and commercial-
database applications make searching it in an ad-hoc way
with SQL rather cumbersome. For, the data distribution and
correlations are often not known. A possible solution is to
map the data into a visual world. Querying then becomes navigation
in a virtual world. The target of this project is to explore
a few visual datamining approaches on top of MonetDB.
The target database is our skyserver implementation.
(Java project)
MonetDB/SQLARRAY (ca 6 months)
The MonetDB/SQL database technology has high potentials
in the area of scientific data management. However,
the existing query language interfaces lack the abstractions
needed, e.g. array support in SQL is still immature.
The goal of this project is to identify enhancements to SQL
to simplify the step for a scientist to use a DBMS.
It is compared with the opportunities offered by
a more functional-style language.
(C project)
MonetDB/SPARQL (ca 6 months).
In this project we seek
to provide a complete implementation of SPARQL,
the embeding of RDF into SQL, on top of the MonetDB
framework. We expect the kernel to be well-prepared
to handle the large graph-based database querying
in this application domain.
(C project)
MonetDB Embedded (ca 6 months)
Embedded databases are widely used. Many small devices rely on hundreds
of calls to organise user-specific behaviour.
The challenge of this project is to realise a front-end
for MonetDB embedded to study its performance consequences, e.g.
compare it with Berkeley DB. The next step would be to improve
the footprint of the code base, aiming for a
version that could run on such small devices as
a Gumstix
part of applicances and robots.
(C project)
MonetDB/LDAP (ca 6 months)
With emerging roaming of users, the need for a central maintenance and
storage of user preferences increases. This kind of information is
typically stored in a directory server, using the LDAP protocol. This
project aims at creating such directory server based on MonetDB, to
allow for very flexible addition or modification of properties in large
environments, while still maintaining a high performance. The project
can target as implementation within OpenLDAP as backend (via an MonetDB
Embedded instance) or as stand alone application where MonetDB handles
the LDAP client calls.
MonetDB SQL:2003 (ca 3 months)
The MonetDB SQL front-end is based on the 1999 standard
and the SQL:2003 standard is approved.
This development project is aimed at covering more
functionality ground, e.g. the datamining functionality.
MonetDB in GIS applications (ca 6 months)
Geographical Information Systems challenge a database system in
the support of both geometric data structures, algorithms,
and management of large binary objects, e.g. satellite images.
This project should scout the available approaches and extend
MonetDB with functionality to become a viable alternative in this field.
MonetDB in Bio Informatics (ca 6 months)
Data-mining and searching in bio-medical information systems
is an emerging field currently hardly supported by standard relational systems.
In this project, the functionality of MonetDB should be extended to
become a viable engine for scientific database, illustrated by
an application in DNA sequence retrieval.
MonetDB Quality Assurance (ca 3 months)
The MonetDB project is a large sized adventure. Many hardware
platforms, several sub-projects and a fair amount of code pose a severe
challenge to its developers.
Every night the system is tested on available platforms, and reports are
sent out to the developers. However, this is not enough to avoid
(compatibility) problems or solve them. Let alone to prevent from
happening. The project needs some tools to help in QA improvements and
better communication of (recently introduced) problems.
MonetDB Stories (ca 3 months, journalists)
Publications on MonetDB functionality targeted at a larger
audiance are needed.
MonetDB promotional applications
MonetDB finds its way in many areas outside our direct view. Feedback and
end-user experiences are therefore hard to get. Students working in
close contact with industry who would like to port their existing applications,
on a trial basis, to the MonetDB platform are invited to contact us.
PhD thesis topics
The MonetDB experimentation platform provides a sound basis to explore
many areas of database kernel research. Its open-source and well-documented
code base permits exploration of novel data structures, algorithms and
optimization techniques geared at exploring the newest hardware developments.
The MonetDB product family road-map provides ample opportunities to
work at the frontier of database research and be able to demonstrate
your innovation.
The algebraic interface over a binary storage model has been proven
pivotal in developing alternative (query) languages on top of MonetDB.
Areas of particular interest are query language support for
(multimedia) information retrieval, which call for a completely different
(probabilistic) evaluation model.
Likewise, the system could be extended in the area of (time) series,
P2P-systems, multi-core systems and streaming applications.
Keep us in formed about your PhD activities.
|