DBpedia: Glue for all Wikipedias and a Use Case for Multilingualism

0 %
100 %
Information about DBpedia: Glue for all Wikipedias and a Use Case for Multilingualism
Technology

Published on May 12, 2014

Author: MarcoFossati

Source: slideshare.net

Description

Talk given at the 7th W3C Multilingual Web Workshop

Glue for all Wikipedias and a Use Case for Multilingualism Dbpedia: Marco Fossati Mariano Rico Martin Brümmer

Extracting knowledge from Wikipedia Dbpedia: Martin Brümmer bruemmer@informatik.uni-leipzig.de

why? Turn documents into data to granularly use and query it How? Mapping wikipedia data to Linked data Result: Multilingual data with a common structure

Multilingual community Guarantee data quality and coverage beyond language borders Organized in chapters 14 Language communities maintaining their language dbpedias Supported by DBpedia association Opening the research project for long-term sponsoring

The center of the lod cloud

Extracting multilingual knowledge Internationalization English Mapping Dbpedia.org

Extracting multilingual knowledge Internationalization

Extracting multilingual knowledge Internationalization

Extracting multilingual knowledge Internationalization

Extracting multilingual knowledge Internationalization Mapped by chapters $lang.Dbpedia.org

USE CASES dbpedia internationalization

Abbrev. base industrial use case

why? Help in text segmentation in the form of exceptions to segmentation rules what? Multilingual Knowledge base of abbreviations how? Extract words that look like sentence boundaries, model via Lemon

T H E I T A L I A N J O B D B P E D I A I N T E R N A T I O N A L I Z A T I O N Marco Fossati fossati@fbk.eu

B U I L D I N G H U G E G A Z E T T E E R S I N D U S T R I A L U S E C A S E

W H Y ? natural language understanding W H A T ? linguistic resource language, domain-specific H O W ? the simplest query

T H E O P E N D A T A L A N D S C A P E U S E R S

O P E N C O E S I O N E . G O V . I T open government

F L O R E N C E N A T I O N A L L I B R A R Y digital libraries

I N F O G R A P H I C S data-driven journalism

S T U D E N T S L E A R N H O W T O T R A N S L A T E A C U L T U R E T H E F I R S T I T A L I A N D B P E D I A M A P P I N G S P R I N T

W H Y ? High quality, multilingual data W H A T ? mapping italian data to the dbpedia ontology H O W ? hackathon in a high school

T H E S P A N I S H A P A R T M E N T A N D N O W … Marco Fossati fossati@fbk.eu

T H E S P A N I S H J O B D B P E D I A I 1 8 N Mariano.Rico@upm.es

T H E S P A N I S H J O B M E X I C A N A R G E N T I N I A N C O L O M B I A N … ( U P T O 2 2 ) D B P E D I A I 1 8 N Mariano.Rico@upm.es

W I K I P E D I A L A N G U A G E S Ranking: (As of 29th Jan. 2014) 1.- English (4.4 M) 2.- German (1.7 M) 3.- French (1.5 M) 4.- Italian (1.1M) Russian (1.1M) Spanish (1.1M) Polish (1.1M) 5.- Japanese (0.9) 6.- Portuguese (0.8M) 7.- Chinese (0.8M)

M A P P I N G R A C E 2 0 1 1 ESDBPEDIA HACKATON ( N O V . 2 0 1 1 ) 15 PEOPLE 4H 4H 101 CLASSES MAPPED 8 0 % I N S T A N C E S M A P P E D

E N G L I S H H U M A N S E S D B P E D I A : T H E W E B S I T E

S P A N I S H H U M A N S E S D B P E D I A : T H E W E B S I T E

L O C A T I O N S E S D B P E D I A : T H E W E B S I T E

L O C A T I O N S E S D B P E D I A : T H E W E B S I T E English (browser) users: 16% (2091 in 12686) Spanish (browser) users: 78%% (10048 in 12686) No es| No en (browser) users: 5%

S P A R Q L Q U E R I E S E S D B P E D I A : T H E S P A R Q L E N D P O I N T Up to 350,000 sparql queries per day 22M SPARQL queries FROM 2200 IPs

S P A R Q L Q U E R I E S E S D B P E D I A : T H E S P A R Q L E N D P O I N T 22M SPARQL queries FROM 2200 IP IPs with more than 103 requests: 60 IPs with requests between 103 and 10: 440 IPs with <less than 10 requests: 1700

N O I S E G E N E R A T O R S E S D B P E D I A : T H E S P A R Q L E N D P O I N T 22M SPARQL queries FROM 2200 IP 2012 9-month queries

L E S S O N S L E A R N T Lesson 1 Take care of IP monsters

L E S S O N S L E A R N T Lesson 2 Take care of NOISE GENERATORS

H T T P : / / D B P E D I A . O R G Mariano.Rico@upm.es T h a n k s f o r y o u r a t t e n t i o n ! fossati@fbk.eu bruemmer@informatik.uni-leipzig.de

Add a comment

Related presentations

Presentación que realice en el Evento Nacional de Gobierno Abierto, realizado los ...

In this presentation we will describe our experience developing with a highly dyna...

Presentation to the LITA Forum 7th November 2014 Albuquerque, NM

Un recorrido por los cambios que nos generará el wearabletech en el futuro

Um paralelo entre as novidades & mercado em Wearable Computing e Tecnologias Assis...

Microsoft finally joins the smartwatch and fitness tracker game by introducing the...

Related pages

DBpedia

Use Case Support Wikipedia Authors with ... Extracting structured data from all 251 versions of DBpedia and interlinking this data with background ...
Read more

DBpedia

Projects & Use Cases. Projects; ... This DBpedia release is based on updated Wikipedia dumps dating from October ... A belated Happy New Year to all ...
Read more

DBpedia Mappings

DBpedia Mappings Wiki. ... dbpedia:Vince_Vaughn rdf: ... Correct the mapping to use the expected mapToClass In all cases, ...
Read more

Wikipedia - Wikipedia, the free encyclopedia

... including all Wikipedias, ... In certain cases, all editors ... stating that academics who endorse the use of Wikipedia are "the intellectual ...
Read more

Elmer's Glue-All Multi-Purpose Adhesive | Elmer's Glue

Elmer's Glue-All Multi-Purpose Glue is stronger than ever before. It is perfect for household repairs, craft and school projects. Projects; Printables ...
Read more

Help:Infobox - Wikipedia, the free encyclopedia

Help:Infobox; Cleanup; ... By browsing the set of all infoboxes via Wikipedia: ... Parameters are case sensitive. Nearly all infoboxes use lowercase ...
Read more

Querying DBpedia - bobdc.blog - The Snee Group

Querying DBpedia. 9 November 2007. And ... so I thought I'd describe here how I successfully implemented my first use case. ... The Simpson episode ...
Read more

Titebond II Premium Wood Glue - Titebond - The Most ...

Titebond II Premium Wood Glue. ... Do not use when temperature, glue or materials are below 55°F. Due to low pH, ... Case UPC Units Per Package ...
Read more

Krazy Glue | Krazy Strong, Fast-Drying Glues that Create ...

Super glue that is krazy strong, krazy fast. It works in as little as 30 seconds, forming an extremely strong bond on all kinds of surfaces
Read more