This is a slightly shortened version of the full Proposal, iniially posted on my user page on MediaWiki.org, and then in final form on the GSoC app site.
Title: | General RDF export/import in Semantic MediaWiki |
Student: | Samuel Lampa |
Abstract: | Rework and extend the import/export functionality of Semantic MediaWiki (SMW) to allow also full, general RDF import. One motivation and use case is to use SMW as a collaborative RDF editor integrated with workflow / workbench software like Bioclipse, for workflows of the type: Import RDF --> Collaboratively edit --> Export in same format The project will include general reworkings of the import/export functionality though, which that specific use case can take advantage of. |
Content: | Identity
Name: Samuel Lampa Contact/working infoTimezone: Sweden (GMT +1) Project summaryExtend the import/export functionality of Semantic MediaWiki (SMW) to allow also full, general RDF import. The background for the idea (for me) is to enable the use of SMW as a general collaborative RDF editor that can be integrated with workflow systems/scriptable workbench software such as Bioclipse, to enable workflows of the type: Import RDF to Wiki --> Collaboratively edit --> Export back in same format ...but this project would include general reworkings of the import/export functionality, which that specific use case can take advantage of. The ideas for a practical approach (based on mail conversation with Denny Vrandecic, who has declared interest to mentor the project), is briefly presented in the deliverables section. The main idea is to replace the defunct RAP with ARC as RDF library (has been discussed on SMW-devel already), and make use of the SMWWriter API, to create a general SPARQL API, with update functionality (Preferrably based on the "SPARQL Update" standard). This would form the technical basis for implementing general RDF export/import functionality. What also needs to be added is functionality to map RDF URI:s to wiki titles. Two approaches have been discussed for that (probably both can be used, depending on use case):
About youI'm a 27 year old biotechnology student att Uppsala university, having much interest in systems biology, computational biology, system design, semantic technologies and web development, currently just finishing my M.Sc. degree in biotechnology (focusing on systems biology and bioinformatics). Much of my technical experience comes from besides my studies, from doing web design since 10+ years, web development with Drupal and MediaWiki for some 4 years, as well as summer work as PC support technician/(Windows) network admin etc. Web development has been done through RIL Partner AB where we are also providing web hosting for a few customers, running our own dedicated servers, optimized for MediaWIki and Drupal, which I'm administrating. At RIL Partner we've been playing around quite a bit with MediaWiki and Semantic MediaWiki, testing out different ideas. In the last few years I've actively focused on getting more hands on coding experience, and hence did a PHP/MediaWiki web interface project at uni, took bioinformatics courses, did a little Java web crawler for use with the Sphinx search engine etc. In my degree project, I'm getting experience from Java coding, Eclipse RCP development and Prolog, as well as getting to know the W3C Semantic formats and technologies. The borders between studies, work and hobby tends to get a bit blurred for me (I'm typically easier to reach by e-mail or skype than by phone :) ). not leaving very much spare time. The time that is over anyway I typically spend hanging out with my family. In the near future I hope to be able to work in the bioinformatics sector, or with systems and knowledge management tools for the Life Sciences. I'll probably continue open source development for Bioclipse and MediaWiki to some extent in the future, as I see both of them as great platforms for the kind of functionality I want to implement and work with. The above proposed GSoC project is highly interesting to me as it would be a killer feature for Bioclipse to be able to export data for community collaboration, and then retrieve it back again. What drives me is a vision to enable better and more systematic knowledge discovery and integration in the Biology / Life Sciences domain, by integrating Semantic technologies, with computational and simulation tools. DeliverablesRequired deliverablesThe order of 1) and 2) does not matter.
During the whole time: document (On blog: http://saml.rilspace.com), support, release. If time permits
Project schedule... ParticipationI prefer having contact daily (or so) on a chat such as IRC or Skype (hanging out daily at Freenode/#bioclipse right now for my degree project) + E-mail for longer discussions. I also much like the idea to use a blog (and really use it!) to document my progress (and make sure I don't forget things learned), and to use GitHub (or similar) for publishing source code. Past open source experience
Any other info
See also |
Created on: | 2010-03-31 14:14:45.188078 |
Last Modified on: | 2010-04-02 01:48:22.189994 |