Sito di partenza

http://annotation.semanticweb.org

Annotea Project

Overview Annotea is a LEAD (Live Early Adoption and Demonstration) project enhancing the W3C collaboration environment with shared annotations. By annotations we mean comments, notes, explanations, or other types of external remarks that can be attached to any Web document or a selected part of the document without actually needing to touch the document. When the user gets the document he or she can also load the annotations attached to it from a selected annotation server or several servers and see what his peer group thinks. Annotea is open; it uses and helps to advance W3C standards when possible. For instance, we use an RDF based annotation schema for describing annotations as metadata and XPointer for locating the annotations in the annotated document. Annotea is part of the Semantic Web efforts. The annotations are stored in annotation servers as metadata and presented to the user by a client capable of understanding this metadata and capable of interacting with an annotation server with the HTTP service protocol. The first client implementation of Annotea is W3C's Amaya editor/browser. Nothing prevents other clients from implementing these capabilities too. The current Amaya user interface for annotations is presented in the Amaya documentation

Commenti

Questo tool è sviluppato dal W3C, si propone di costruire una infrastruttura basata su RDF per annotare le pagine web facendo uso di server dedicati per mantenere le annotazioni. Le annotazioni sono intese come commenti al contenuto delle pagine annotate. Questi commenti possono essere classificati in diversi modi (commento, correzione, link, etc.). Le annotazioni sono completate con metadati, sempre espressi in triple RDF, quali: autore, risorsa annotata, data, e altri. L’utilità di queste annotazioni risiede nella possibilità fornita ad un utente di leggere le annotazioni presenti in quella pagina fatte da altri utenti. Per ora non hanno previsto un motore di ricerca che sfrutti l’annotazione delle pagine. Un aspetto positivo è che usa solamente degli standard per realizzare questa infrastruttura e che il codice è disponibile per eventuali estensioni. In questa infrastruttura è possibile annotare sia l’intera pagina web, che parti di essa. La fase di annotazione consiste nel selezionare la risorsa da annotare e inserire il testo dell’annotazione (body). Questa annotazione può essere salvata in locale o sul server. Quando un nuovo utente visita la pagina annotata, compaiono delle icone nella pagina che indicano la presenza di annotazioni; cliccandoci sopra è possibile visualizzare l’annotazione. Non vengono menzionate le ontologie. L’annotazione non è fatta rispetto a nessuna ontologia.

http://www.w3.org/2001/Annotea/

Annotation System for Semantic Web

Annotations – An overview Annotation is one of the most common forms of metadata in the context of Semantic Web. Annotations can be comments, notes, explanations, questions, references, examples, advice, correction or any other type of external remark that can be attached to a Web document or a selected part of the document. As they are external, it is possible to annotate any Web document independently, without needing to edit that document. From the technical point of view, annotations are usually seen as metadata, as they give additional information about an existing piece of data. They can be stored either at the user workstation for his private use or at one/more common servers. Annotations stored in public servers can be made available to anybody with proper authentication. By this way everybody can share their views/comments about a particular web site regarding its content, usefulness, etc., Since anybody can make annotations in any page, the visitors of the site can get a wide range of views – without being restricted to the views of the author of the page alone, as in the case of META tags. Also the META tags applies, in general to the whole page – whereas annotations can be made for a particular section/text in the page.

Our Annotation System is based on a three-tiered architecture.

A client that can be used for making/searching for annotations.
An application server which handles all requests from client.
A database server for storing the annotations made by the users. Having realized the close coupling of the annotation with the Internet, our idea is to develop the client as a component so that we can plug-in in the Internet Browsers. And our choice was Microsoft Internet Explorer, the most popular browser. The client also supports storing annotations in the local machine as an alternative. The Annotation client essentially has two components.
A Toolbar (like the Google Toolbar^TM).
A Browser Bar (like the "History" or "Favourites" explorer bars in Internet Explorer ).

http://www.ncb.ernet.in/groups/dake/annotate/index.shtml

Commenti

Questo tool sembra ben integrato nell’ambiente del browser. L’annotazione avviene selezionando il testo “interessante” e riempiendo il form che viene proposto con tutte le informazioni richieste. Tra queste informazioni c’è il testo vero e proprio dell’annotazione. L’annotazione rimane visibile nella pagina grazie alla presenza di un piccolo simbolo a forma di matita.

MnM: Ontology Driver Semi-Automatic and Automatic Support for Semantic Markup

MnM is an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents. MnM integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tools.

http://kmi.open.ac.uk/projects/akt/MnM/index.html

Commenti

Questo progetto sembra molto interessante. E’ molto recente, infatti solo in questi giorni hanno creato la home page del progetto ed è in rapida evoluzione. Le cose più interessanti sono:

-possibilità di annotare rispetto ad ontologie ben precise;

-possibilità di navigare le varie ontologie rispetto a cui annotare i documenti;

-possibilità di annotare automaticamente dei documenti dopo una fase di “learning”;

-controllo di validità dei risultati rispetto alle ontologie usate. La filosofia e il concetto di Semantic Web è interessante ed in un qualche modo si avvicina all’idea del Sewasie. Gli autori del progetto dicono che il loro sistema è simile ad OntoMat (vedi dopo). Il tool richiede diversi pacchetti software per funzionare (per adesso funziona solo su Windows 2000). Questo tool fa uso di Annie e Amilcare, per trovare i token dei documenti (parole, frasi, etc.) e per indurre le regole di estrazione. In futuro prevedono di utilizzare l’annotazione così creata per migliorare le ricerche facendo anche uso di agenti.

OntoMat-Annotizer

OntoMat-Annotizer is a user-friendly interactive webpage annotation tool. It supports the user with the task of creating and maintaining ontology-based DAML+OIL markups i.e. creating of DAML-instances, attributes and relationships. It include an ontology browser for the exploration of the ontology and instances and a HTML browser that will display the annotated parts of the text. It is Java-based and provide a plugin interface for extensions. The intended user is the individual annotator i.e., people that want to enrich their web pages with DAML-meta data. Instead of manually annotating the page with a text editor, say, emacs, OntoMat allows the annotator to highlight relevant parts of the web page and create new instances via drag’n’drop interactions. It supports the meta-data creation phase of the lifecycle. It is planned that a future version will contain an information extraction plugin, that offers a wizard which suggest which parts of the text are relevant for annotation. That aspect will help to ease the time-consuming annotation task.

Commenti

OntoMat è l’implementazione di riferimento di CREAM. CREAM è il framework per l’annotazione studiato all’università di Karlsruhe. C’è una versione di CREAM – S-CREAM – che consente di estrarre le informazioni in modo automatico. Per estrarre automaticamente le informazioni occorre prima eseguire una fase di addestramento del tool che esegue l’estrazione. L’utente deve fornire un set di documenti (simili) e annotarli in modo opportuno prima di poter iniziare la fase di “learning”. L’output di questa fase è una serie di regole di inferenza che verranno poi usate dal tool per estrarre la conoscenza dai documenti sui quali verrà invocato. Le informazioni estratte possono essere rifinite. Per estrarre le informazioni usano due strumenti: Annie e Amilcare, per trovare i token dei documenti (parole, frasi, etc.) e per indurre le regole di estrazione.

L’annotazione che può anche essere fatta manualmente, fa riferimento ad una o più ontologie.

http://annotation.semanticweb.org/tools/ontomat

SMORE: Semantic Markup, Ontology and RDF Editor

SMORE is a tool that allows users to markup their documents in RDF using web ontologies in association with user- specific terms and elements. The aim of this software is as follows:

To provide the user with a flexible environment in which he can create his web page without too many hindrances involving markup
To allow the user to markup his document with minimal knowledge of RDF terms and syntax. However, the user should be able to semantically classify his data set for annotation i.e. breakup sentences into the basic subject-predicate-object model
To provide a reference to existing ontologies on the Internet in order to use more precise references in his own web page/text. The user can also create his own ontology from scratch and borrow terms from existing ontologies
To ensure accurate and complete RDF markup with scope to make modifications easily

http://www.mindswap.org/~aditkal/editor.shtml

Commenti

Questo tool è sviluppato all’interno del progetto MINDSWAP. Consente di annotare anche immagini (o parti di esse) – PhotoSMORE. Vuole ridurre la differenza tra la creazione dei contenuti (creazione di pagine web – authoring) e la fase di annotazione. SMORE ha all’interno un html editor. Il processo di annotazione è più o meno simile agli altri: si seleziona una parte del testo e si creano delle triple (tipo RDF). Gli elementi delle triple possono essere associati agli elementi di ontologie anche in un momento successivo alla creazione della tripla stessa. C’è un componente – MailSMORE – che consente di arricchire le mail scritte rispettando certe strutture, di triple collegate ad ontologie relative alla posta elettronica. Questo framework consente di modificare, navigare, cercare ontologie, aiutando in questo modo la ricerca dell’ontologia più adatta da associare all’annotazione che si sta effettuando. E’ inoltre possibile cercare dinamicamente l’ontologia che più si addice al contesto specifico. C’è un sistema che inferisce alcuni tipi di triple e le propone all’utente che può eventualmente cancellarle. Scraper è un modulo che consente di analizzare il contenuto di pagine web alla ricerca di strutture predefinite quali tabelle, campi, liste e creare automaticamente triple RDF. C’è un sistema per la gestione delle ontologie – Parka-DB – che si affida ad un DBMS e gestisce le triple RDF per fare inferenza.

http://www.mindswap.org/.