News and Updates on the KRR Group
Header image

Source: Semantic Web world for you
The VU is making short videos of 1 minute to highlight some of the research that is being done within its walls. This is the video for SemanticXO, realised by Pepijn Borgwat and presented by Laurens Rietveld. The script is in Dutch and is as follows: Ik ben laurens rietveld en ik doe onderzoek aan de vrije [...]

TabLinker is experimental software for converting manually annotated Microsoft Excel workbooks to the RDF Data Cube vocabulary. It is used in the context of the Data2Semantics project to investigate the use of Linked Data for humanities research (Dutch census dataproduced by DANS).

TabLinker was designed for converting Excel or CSV files to RDF (triplification, RDF-izing) that have a complex layout and cannot be handled by fully automatic csv2rdf scripts.

A presentation about Linked Census Data, including TabLinker is available from SlideShare.

Please consult the Github page for the latest release information.

Using TabLinker

TabLinker takes annotated Excel files (found using the srcMask option in the config.ini file) and converts them to RDF. This RDF is serialized to the target folder specified using the targetFolder option in config.ini.

Annotations in the Excel file should be done using the built-in style functionality of Excel (you can specify these by hand). TabLinker currently recognises seven styles:

  • TabLink Title - The cell containing the title of a sheet
  • TabLink Data - A cell that contains data, e.g. a number for the population size
  • TabLink ColHeader - Used for the headers of columns
  • TabLink RowHeader - Used for row headers
  • TabLink HierarchicalRowHeader - Used for multi-column row headers with subsumption/taxonomic relations between the values of the columns
  • TabLink Property - Typically used for the header cells directly above RowHeader or HierarchicalRowHeader cells, cell values are the properties that relate Data cells to RowHeader and HierarchicalRowHeader cells.
  • TabLink Label - Used for cells that contain a label for one of the HierarchicalRowHeader cells.

An eight style, TabLink Metadata, is currently ignored (See #3).

An example of such an annotated Excel file is provided in the input directory. There are ways to import the styles defined in that file into your own Excel files.

Tip: If your table contains totals for HierarchicalRowHeader cell values, use a non-TabLink style to mark the cells between the level to which the total belongs, and the cell that contains the name of the total. Have a look at the example annotated Excel file to see how this is done (up to row 428).

Once you’re all set, start the TabLinker by cd-ing to the src folder, and running:

python tablinker.py

Requirements

TabLinker was developed under the following environment:

Source: Think Links

Update: A version of this post appeared in SURF magazine (on the back page) in their trendwatching column

Technology at its best lets us do what we want to do without being held back by time consuming or complex processes. We see this in great consumer technology: your phone giving you directions to the nearest cafe, your calendar reminding you of a friend’s birthday, or a website telling you what films are on. Good technology removes friction.

While attending the SURF Research day, I was reminded that this idea of removing friction through technology shouldn’t be limited to consumer or business environments but should also be applied in academic research settings. The day showcased a variety of developments in information technology to help researchers do better research. Because SURF is a Dutch organization there was a particular focus on developments here in the Netherlands.

The day began with a fantastic keynote from Cameron Neylon outlining how networks qualitatively change how research can be communicated. A key point was that to create the best networks we need to make research communication as frictionless as possible.  You can find his longer argument here. After Cameron’s talk, Jos Engelen the chairman of the NWO (the Dutch NSF) gave some remarks. For me, the key take-away was that in every one of the Dutch Government’s 9 Priority Sectors, technology has a central role in smoothing both the research process and its transition to practice.

After the opening session, there were four parallel sessions on text analysis, dealing with data, profiling research, and technology for research education. I managed to attend parts of three of the sessions. In the profiling session, the recently released SURF Report on tracking the impact of scholarly publications in the 21st century, sparked my interest.  Finding new faster and broader ways of measuring impact (i.e. altmetrics)  is a way of reducing friction in science communication. The ESCAPE project showed how enriched publications can make it easy to collate and browse related content around traditional articles. The project won SURF’s enriched publication of the year award. Again, the key, simplifying the research process.  Beyond these presentations, there were talks ranging from making it easier to do novel chemistry to helping religious scholars understand groups through online forms. In each case, the technology was successful because it eliminated friction in the research process.

The SURF research day presented not just technology but how, when it’s done right, technology can make research just a bit smoother.

Filed under: academia, altmetrics Tagged: events, ozdag, surffounation

Source: Think Links

Technology at its best lets us do what we want to do without being held back by time consuming or complex processes. We see this in great consumer technology: your phone giving you directions to the nearest cafe, your calendar reminding you of a friend’s birthday, or a website telling you what films are on. Good technology removes friction.

While attending the SURF Research day, I was reminded that this idea of removing friction through technology shouldn’t be limited to consumer or business environments but should also be applied in academic research settings. The day showcased a variety of developments in information technology to help researchers do better research. Because SURF is a Dutch organization there was a particular focus on developments here in the Netherlands.

The day began with a fantastic keynote from Cameron Neylon outlining how networks qualitatively change how research can be communicated. A key point was that to create the best networks we need to make research communication as frictionless as possible.  You can find his longer argument here. After Cameron’s talk, Jos Engelen the chairman of the NWO (the Dutch NSF) gave some remarks. For me, the key take-away was that in every one of the Dutch Government’s 9 Priority Sectors, technology has a central role in smoothing both the research process and its transition to practice.

After the opening session, there were four parallel sessions on text analysis, dealing with data, profiling research, and technology for research education. I managed to attend parts of three of the sessions. In the profiling session, the recently released SURF Report on tracking the impact of scholarly publications in the 21st century, sparked my interest.  Finding new faster and broader ways of measuring impact (i.e. altmetrics)  is a way of reducing friction in science communication. The ESCAPE project showed how enriched publications can make it easy to collate and browse related content around traditional articles. The project won SURF’s enriched publication of the year award. Again, the key, simplifying the research process.  Beyond these presentations, there were talks ranging from making it easier to do novel chemistry to helping religious scholars understand groups through online forms. In each case, the technology was successful because it eliminated friction in the research process.

The SURF research day presented not just technology but how, when it’s done right, technology can make research just a bit smoother.

Filed under: academia, altmetrics Tagged: events, ozdag, surffounation