Difference between revisions of "PundIt MarineLives Forum"

From MarineLives
Jump to: navigation, search
m
Line 16: Line 16:
 
The DM2E project is a project of [http://en.wikipedia.org/wiki/Europeana Europeana], which has emerged out of the European Digital Library Network.
 
The DM2E project is a project of [http://en.wikipedia.org/wiki/Europeana Europeana], which has emerged out of the European Digital Library Network.
  
MarineLives is working with Dr Christian Morbidioni and Dr Simone Fonda of the University of Pisa to explore a working demo of PundIt. [https://docs.google.com/document/d/1y2j_6sYb2FoTEKlwXEg3rO7T8CwbyeQvy1sSKxyYEic/ The demo can be found at here].
+
MarineLives is working with [http://www.linkedin.com/pub/christian-morbidoni/0/97a/475 Dr Christian Morbidioni] of the University of Pisa and [http://lodlam.net/members/pundit/profile/ Simone Fonda] of Net7 to explore a working demo of PundIt. [https://docs.google.com/document/d/1y2j_6sYb2FoTEKlwXEg3rO7T8CwbyeQvy1sSKxyYEic/ The demo can be found here].
 +
 
 +
In parallel, MarineLives is working with Dr Christian Eckert and Dominique Ritze of the University of Mannheim, to explore the potential for automatic and semi-automatic entity recognition for MarineLives transcriptions.  The topics of semantic annotation and entity recognition are clearly closely related.
  
In parallel, MarineLives is working with Dr Christian Eckert and Dominique Ritze of the University of Mannheim, to explore the potential for automatic and semi-automatic entity recognition for MarineLives transcriptions.  The topics of semantic annotation and entity recognition are clearly closely related
+
----
 +
'''Approach to evaluation'''
 +
 
 +
We would like to focus our experimentation with PundIt on the High Court of Admiralty deposition book, HCA 13/72.  Roughly 700 pages of HCA 13/72 have been transcribed and edited, and are available in edited form on the following [[HCA 13/72: |Annotate HCA 13/72 wiki]]. Digital images of many (but not all) of the same transcribed pages can be viewed in our tailored transcription software, [http://marinelives-transcript.org/scripto/category/hca-1372/page/2/ MarineLives - Transcript] together with the transcriptions.
 +
 
 +
We suggest that evaluators try annotating web pages from both the wiki version of the transcribed text and the MarineLives - Transcript version of the transcibed text (and indeed the images themselves, and or image fragments).
 +
 
 +
The current PundIt demo has been set up with sample custom voccabularies extracted from the Annotate HCA 13/71 wiki.  Colin Greenstreet is exploring with Simone Fonda how we can create new custom vocabularies for ships, people, places and materials specifically for HCA 13/72, and how we can then add and edit new individual records in these custom vocabularies.
 
----
 
----
 
__TOC__
 
__TOC__
Line 26: Line 35:
 
[http://dm2e.eu/ DM2E]
 
[http://dm2e.eu/ DM2E]
 
[http://www.europeana.eu/ Europeana]
 
[http://www.europeana.eu/ Europeana]
 +
 +
[[HCA 13/72: |Annotate HCA 13/72 wiki]]
 +
[http://marinelives-transcript.org/scripto/category/hca-1372/page/2/ MarineLives - Transcript: HCA 13/72 pages]
 +
 +
----
 +
==Rolling list of questions about PundIt functionality in the context of MarineLives project==

Revision as of 09:13, October 30, 2013

PundIt MarineLives Forum

Editorial history

30/10/13: CSG, created page



Purpose of this page

This page provides a discussion forum and set of resources for MarineLives project members exploring the functionality of the PundIt tool.


Background

PundIt is an experimental semantic annotation tool for web pages which is currently under further development by Net7, and which is being used by the DM2E project.

The DM2E project is a project of Europeana, which has emerged out of the European Digital Library Network.

MarineLives is working with Dr Christian Morbidioni of the University of Pisa and Simone Fonda of Net7 to explore a working demo of PundIt. The demo can be found here.

In parallel, MarineLives is working with Dr Christian Eckert and Dominique Ritze of the University of Mannheim, to explore the potential for automatic and semi-automatic entity recognition for MarineLives transcriptions. The topics of semantic annotation and entity recognition are clearly closely related.



Approach to evaluation

We would like to focus our experimentation with PundIt on the High Court of Admiralty deposition book, HCA 13/72. Roughly 700 pages of HCA 13/72 have been transcribed and edited, and are available in edited form on the following Annotate HCA 13/72 wiki. Digital images of many (but not all) of the same transcribed pages can be viewed in our tailored transcription software, MarineLives - Transcript together with the transcriptions.

We suggest that evaluators try annotating web pages from both the wiki version of the transcribed text and the MarineLives - Transcript version of the transcibed text (and indeed the images themselves, and or image fragments).

The current PundIt demo has been set up with sample custom voccabularies extracted from the Annotate HCA 13/71 wiki. Colin Greenstreet is exploring with Simone Fonda how we can create new custom vocabularies for ships, people, places and materials specifically for HCA 13/72, and how we can then add and edit new individual records in these custom vocabularies.






Suggested links


DM2E
Europeana

Annotate HCA 13/72 wiki
MarineLives - Transcript: HCA 13/72 pages



Rolling list of questions about PundIt functionality in the context of MarineLives project