Wednesday, September 19, 2018 - 8:53am
More than 110 DBpedia enthusiasts joined the Community Meeting in Vienna.
After the success of the last two community meetings in Amsterdam and Leipzig, we thought it is time to meet you at the SEMANTiCS conference again. This year’s SEMANTiCS opened with the DBpedia Day on September 10th, 2018 in Vienna.
First and foremost, we would like to thank the Institute for Applied Informatics for supporting our community and many thanks to the Technical University Vienna and the SEMANTiCS for hosting our community meeting.
Javier David Fernández García, Vienna University of Economics, opened the meeting with his keynote Linked Open Data cloud – act now before it’s too late. He reflected on challenges towards arriving at a truly machine-readable and decentralized Web of Data. Javier reviewed the current state of affairs, highlighted key technical and non-technical challenges, and outlined potential solution strategies.
The second keynote speaker was Mathieu d’Aquin, Professor of Informatics at the Insight Centre for Data Analytics at NUI Galway. Mathieu, who is specialized in data analytics, completed the meeting with his keynote Dealing with Open Domain Data.
Patrik Schneider started the DBpedia Showcase Session with his presentation of the “NII (Japan) Research Showcase – A Knowledge Graph Management Framework for DBpedia”. Shortly after, Jan Forberg, from AKSW/KILT Leipzig, promoted the usage of WebIDs in a short how-to tutorial session. Adam Sanchez, from University Grenoble Alpes, talked about RDFization of a relational database from medicine domain by using Ontop. Followed by another presentation by Beyza Yaman, University of Genoa, talking about Exploiting Context-Dependent Quality Metadata for Linked Data Source Selection. Afterwards, Robert Bielinski, from AKSW/KILT Leipzig, introduced the new DBpedia release circle by using Apache Spark. Closing the Showcase Session, Tomas Kliegr, University of Economics Prague, presented a showcase using DBpedia to study cognitive biases affecting interpretation of machine learning results.
For further details of the presentations follow the links to the slides.
- WebID Creation by Jan Forberg, AKSW/KILT slides
- RDFization by Adam Sanchez, Université Grenoble Alpes slides
- Exploiting Context-Dependent Quality Metadata by Beyza Yaman, University of Genoa slides
- Extracting Data using Apache Spark by Robert Bielinski, AKSW/KILT slides
- Using DBpedia to study cognitive biases affecting interpretation of machine learning results by Tomas Kliegr, University of Economics Prague slides
As a regular part of the DBpedia Community Meeting, we had two parallel sessions in the afternoon where DBpedians can discuss technical issues. Participants interested in NLP-related topics joined the NLP & DBpedia session. Milan Dojchinovski (AKSW/KILT) chaired this session with four very stimulating talks. Hereafter you will find all presentations given during this session:
- Enriching DBpedia by Knowledge Base Population and Dark Entity Resolution by Key-Sun Choi, KAIST, Korea slides
- NED for Cultural Heritage using DBpedia by Gary Munnelly, Trinity College Dublin slides
- Entity Linking using MAG by Diego Moussallem, AKSW, Leipzig University slides
- Temporal Role of Named Entities with help of DBpedia by Maria Koutraki, FIZ Karlsruhe slides
At the same time, the DBpedia Association Hour provided a platform for the community to discuss technical questions and especially the DBpedia databus. Sebastian Hellmann presented the DBpedia databus and explained the advantages of global IDs. Shortly after, Marvin Hofer (AKSW/KILT) demonstrated the new DBpedia global ID webinterface. Please find his slides here.
The 12th edition of the DBpedia Community Meeting also covered a special chapter session, chaired by Enno Meijers, from the Dutch DBpedia Language Chapter. The speakers presented the latest technical or organizational developments of their respective chapter.
Following, you find a list of all presentations of this session:
- Dutch DBpedia Language Chapter by Enno Meijers, National Library of the Netherlands slides
- Spanish DBpedia Chapter by Mariano Rico, Technical University of Madrid (UPM) slides
- Portuguese DBpedia Chapter by Diego Moussallem, AKSW slides
- French DBpedia Chapter by Elmahdi Korfed, INRIA slides
- Catalan DBpedia Chapter by Jens Grivolla, Pompeu Fabra University slides
- Czech DBpedia Chapter by Milan Dojchinovski, AKSW/KILT slides
This session has mainly created an exchange platform for the different DBpedia chapters. For the first time, representatives of the European chapters discussed problems and challenges of DBpedia from their point of view. Furthermore, tools, applications and projects were presented by each chapter.
Summing up, the 12th DBpedia Community Meeting brought together more than 110 DBpedia enthusiasts from Europe who engaged in vital discussions about Linked Data, the DBpedia databus as well as DBpedia use cases and services.
Thursday, September 13, 2018 - 11:04am
While everyone at the DBpedia Association was preparing for the SEMANTiCS Conference in Vienna, we also managed to reach an important milestone regarding the beta-test for our data release tool.
First and foremost, already 3500 files have been published with the plugin. These files will be part of the new DBpedia release and are available on our LTS repository.
Now we have some time to support you and work one on one and also prepare the configurations to help you set up the data releases. Lastly, we already received data from DNB and SUMO, so we will start to look into these more closely.
Thanks to all the beta-testers for your nice work.
We keep you posted.
Wednesday, August 22, 2018 - 12:33pm
This year’s GSoC is slowly coming to an end with final evaluations already being submitted. In order to bridge the waiting time until final results are published, we like to draw your attention to a former project and great tool that was developed during last years’ GSoC.
DBpedia Chatbot is a conversational Chatbot for DBpedia which is accessible through the following platforms:
- A Web Interface
- Facebook Messenger
The bot is capable of responding to users in the form of simple short text messages or through more elaborate interactive messages. Users can communicate or respond to the bot through text and also through interactions (such as clicking on buttons/links). There are 4 main purposes for the bot. They are:
- Answering factual questions
- Answering questions related to DBpedia
- Expose the research work being done in DBpedia as product features
- Casual conversation/banter
The bot tries to answer text-based questions of the following types:
Natural Language Questions
- Give me the capital of Germany
- Who is Obama?
- Where is the Eiffel Tower?
- Where is France’s capital?
Users can ask the bot to check if vital DBpedia services are operational.
- Is DBpedia down?
- Is lookup online?
Users can ask basic information about specific DBpedia local chapters.
- DBpedia Arabic
- German DBpedia
These are predominantly questions related to DBpedia for which the bot provides predefined templatized answers. Some examples include:
- What is DBpedia?
- How can I contribute?
- Where can I find the mapping tool?
Messages which are casual in nature fall under this category. For example:
- What is your name?
if you like to have a closer look at the internal processes and how the chatbot was developed, check out the DBpedia GitHub pages.
DBpedia Chatbot was published on wiki.dbpedia.org and is one of many other projects and applications featuring DBpedia.
Powered by WPeMatico
In case you want your DBpedia based tool or demo to publish on our website just follow the link and submit your information, we will do the rest.
Tuesday, August 14, 2018 - 3:07pm
Today we are featuring DBpedia Entity, in our blog series of introducting interesting DBpedia applications and tools to the DBpedia community and beyond. Read on and enjoy.
DBpedia-Entity is a standard test collection for entity search over the DBpedia knowledge base. It is meant for evaluating retrieval systems that return a ranked list of entities (DBpedia URIs) in response to a free text user query.
The first version of the collection (DBpedia-Entity v1) was released in 2013, based on DBpedia v3.7 . It was created by assembling search queries from a number of entity-oriented benchmarking campaigns and mapping relevant results to DBpedia. An updated version of the collection, DBpedia-Entity v2, has been released in 2017, as a result of a collaborative effort between the IAI group of the University of Stavanger, the Norwegian University of Science and Technology, Wayne State University, and Carnegie Mellon University . It has been published at the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’17), where it received a Best Short Paper Honorable Mention Award. See the paper and poster.
DBpedia Entity was published on wiki.dbpedia.org and is one of many other projects and applications featuring DBpedia.
Powered by WPeMatico
The post DBpedia Entity – Standard Test Collection for Entity Search over DBpedia appeared first on DBpedia Blog.
Tuesday, August 7, 2018 - 10:56am
Finally, we are proud to announce that the beta-testing of our data release tool for data releases on the DBpedia Databus is about to start.
In the past weeks our developers at DBpedia have been devloping a new data release tool to release datasets on the DBpedia Databus. In that context we are still looking for beta-testers who have a dataset they wish to release. Sign up here and benefit from an increased visibility for your dataset and your work done.
We are now preparing the first internal test with our own dataset to ensure the data release tool is ready for the testers. During the testing process, beta-testers will discuss occuring problems, challenges and ideas for improvement via the DBpedia #releases channel on Slack to profit from each other’s knowledge and skills. Issues are documented via GitHub.
The whole testing process for the data release tool follows a 4-milestones plan:
Milestone One: Every tester needs to have a WebID to release data on the DBpedia Databus. In case you are interested in how to set up a WebID, our tutorial will help you a great deal.
Milestone Two: For their datasets, testers will generate DataIDs, that provide detailed descriptions of the datasets and their different manifestations as well as relations to agents like persons or organizations, in regard to their rights and responsibilities.
Milestone Three: This milestone is considered as achieved, if an RSS feed feature can be genreated. Additionally, bugs, that arose during the previous phases should have been fixed. We also want to collect the testers particular demands and wishes that would benefit the tool or the process. A second release can be attempted to check how integrated fixes and changes work out.
Milestone Four: This milestone marks the final upload of the dataset to the DBedia Databus which is hopefully possible in about 3 weeks.
In case you want to get one of the last spots in the beta-testing team, just sign up here and get yourself a WebID and start testing.
Looking forward to working with you…
Tuesday, July 17, 2018 - 10:23am
We are happy to announce that the 12th DBpedia Community Meeting will be held in Vienna, Austria. At the beginning of SEMANTiCS 2018, Sep 10-13, the DBpedia Community will get together on the 10th of September for the DBpedia Day.
– Keynote presentation by Javier David Fernández García (WU Vienna)
– Keynote presentation by Mathieu d’Aquin (NUI Galway)
– DBpedia Association hour
– DBpedia Chapter Session
– Tell us what cool things you do with DBpedia: https://goo.gl/forms/ngRWCjgH9ocDCrEb2
– Web URL: http://wiki.dbpedia.org/meetings/Vienna2018
– Hashtag: #DBpediaDay
– When: September 10th, 2018
– Where: Gußhaus Campus of Vienna’s Technical University, Gußhausstraße 27-29, 1040 Vienna, Austria
– Call for Contribution: Submit your proposal in our form.
– Attending the DBpedia Community Meeting costs €50 (excl. registration fee and VAT). DBpedia members get free admission, please contact your nearest DBpedia chapter or the DBpedia Association for a promotion code.
– You need to buy a ticket. Please check all details here: https://2018.semantics.cc/registration
Please check our schedule for the 12th DBpedia Community meeting here: https://wiki.dbpedia.org/meetings/Vienna2018
– When: September 9th, 2018
– Where: SBA Research, Favoritenstraße 16, 1040 Vienna, Austria
– What: We will discuss the development strategy of the DBpedia Association with members of the DBpedia chapters. You are cordially invited to participate in the discussion to shape the strategy of DBpedia.
– Registration can be made by email.
Sponsors and Acknowledgments
– Technical University Wien (https://www.tuwien.ac.at/en/)
– Institute for Applied Informatics (https://infai.org/)
– OpenLink Software (http://www.openlinksw.com/)
– SEMANTiCS Conference Sep 10-13, 2018 in Vienna (https://2018.semantics.cc/)
– SBA Research (https://www.sba-research.org/)
In case you want to sponsor the 12th DBpedia Community Meeting, please contact the DBpedia Association via firstname.lastname@example.org.
– Julia Holze, DBpedia Association
– Sebastian Hellmann, AKSW/KILT, DBpedia Association
We are looking forward to meeting you in Vienna!
Your DBpedia Association
The post Call for Participation: 12th DBpedia Community Meeting in Vienna appeared first on DBpedia Blog.
Friday, July 6, 2018 - 11:56am
Rencontre avec les français DBpédiens à Lyon
In cooperation with Thomas Riechert (HTWK/InfAI), the DBpedia Association organized our second DBpedia meetup this year, this time in Lyon. On July 3rd, 2018, we met the French DBpedia Community at the ENS in person and presented the vision of the new DBpedia Databus, an opportunity which simplifies the work with data.
First and foremost, we would like to thank the Institute for Applied Informatics for supporting our community and the LARHRA Laboratory as well as the ENS for hosting our community meetup. Special thanks go to Thomas Riechert and Vincent Alamercery (LARHRA Lyon) for organizing the event.
Sebastian Hellmann (AKSW/KILT) opened up the meetup in Lyon by introducing the DBpedia development strategy and the new DBpedia Databus to the French DBpedia community (slides). Afterwards, Elmahdi Korfed from INRIA presented new features and tools as results developed in the French DBpedia chapter (slides):
In the following months, Elmahdi plans to work on the DBpedia historic live version and the DBpedia wiki commons. His research will be presented during our 12th DBpedia Community meeting on September 10th, in Vienna.
Following Elmahdi, Francesco Beretta presented LARHRA laboratory and its different research areas. In particular, he introduced the Data for History Consortium which is an international consortium founded in 2017 with the aim of improving geo-historical data interoperability in the semantic web.
The afternoon track started out with an inspiring presentation by Adam Sanchez from the University of Grenoble. He talked about ‘RDFization of a relational database from medicine domain using Ontop’ (slides) and introduced the Ontop mappings. Afterwards, Oscar Rodríguez Rocha (University of Côte d’Azur) showcased the application ‘Automatic Generation Educational Quizzes’ from DBpedia (slides) and explained how the automatic generation of quizzes works based on the game Les Incollables.
The meeting concluded with a dynamic discussion on the DBpedia Databus and potential collaborations between the DBpedia Association and the French DBpedia Chapter.
You still can’t get enough of DBpedia?
Don’t worry, we already have another meeting of the DBpedia community in the pipeline. Our 12th DBpedia Community meeting is scheduled for September 10th and preparations on the program are already in full swing. Our DBpedia Day will kick-off this year’s edition of SEMANTiCS 2018, hosted at TU Vienna and brings the European DBpedia community together.
You want to contribute? Please submit your proposal and be a part of our amazing program. Register here and meet us and other DBpedia enthusiasts in Vienna. We are looking forward to your contribution.
See you soon!
The post French DBpedia enthusiasts joined the meetup in Lyon. appeared first on DBpedia Blog.
Wednesday, June 27, 2018 - 10:31am
Unfortunately, with the new GDPR, we experienced some trouble with our Blog. That is why this post is published a little later than anticipated.
There you go.
With our new strategic orientation and the emergence of the DBpedia Databus, we wanted to meet some DBpedia enthusiasts of the German DBpedia Community.
The recently hosted 6th LSWT (Leipzig Semantic Web Day) on June 18th, was the perfect platform for DBpedia to meet with researchers, industry and other organizations to discuss current and future developments of the semantic web.
Under the motto “Linked Enterprises Data Services”, experts in academia and industry talked about the interlinking of open and commercial data of various domains such as e-commerce, e-government, and digital humanities.
Sören Auer, DBpedia endorser and board member as well as director of TIB, the German National Library of Science and Technology, opened the event with an exciting keynote. Recapping the evolution of the semantic and giving a glimpse into the future of integrating more cognitive processes into the study of data, he highlighted the importance of AI, deep learning, and machine learning. They are as well as cognitive data, no longer in their early stages but advanced to fully grown up sciences.
Shortly after, Sebastian Hellmann, director of the DBpedia Association, presented the new face of DBpedia as a global open knowledge network. DBpedia is not just the most successful open knowledge graph so far, but also has a deep inside knowledge about all connected open knowledge graphs (OKG) and how they are governed.
With our new credo connecting data is about linking people and organizations, the global DBpedia platform aims at sharing efforts of OKG governance, collaboration, and curation to maximize societal value and develop a linked data economy.
The DBpedia Databus functions as Metadata Subscription Repository, a platform that allows exchanging, curate and access data between multiple stakeholders. In order to maximize the potential of your data, data owners need a WebID to sign their Metadata with a private key in order to make use of the full Databus services. Instead of one huge monolithic release every 12 months the Databus enables easier contributions and hence partial releases (core, mapping, wikidata, text, reference extraction) at their own speed but in much shorter intervals (monthly). Uploading data on the databus means connecting and comparing your data to the network. We will offer storage services, free & freemium services as well as data-as-a-service. A first demo is available via http://downloads.dbpedia.org/databus
During the lunch break, LSWT participants had time to check out the poster presentations. 4 of the 18 posters used DBpedia as a source. One of them was Birdory, a memory game developed during the Coding Da Vinci hackathon, that started in April 2018. Moreover, other posters also used the DBpedia vocabulary.
In the afternoon, participants of LSWT2018 joined hands-on tutorials on SPARQL and WebID. During the SPARQL tutorial, ten participants learned about the different query types, graph patterns, filters, and functions as well as how to construct SPARQL queries step by step with the help of a funny Monty Python example.
Afterwards, DBpedia hosted a hands-on workshop on WebID, the password-free authentication method using semantics. The workshop aimed at enabling participants to set up a public/private key, a certificate, and a WebID. Everything they needed to bring was a laptop and an own webspace. Supervised by DBpedia’s executive director Dr. Sebastian Hellmann and developer Jan Forberg, people had to log-into a test web service at the end of the session, to see if everything worked out. All participants seemed well satisfied with the workshop – even if not everyone could finish it successfully they got a lot of individual help and many hints. For support purposes, DBpedia will stay close in touch with those participants.
We are currently looking forward to our next DBpedia meetup in Lyon, France on July 3rd and the DBpedia Day co-located with Semantics 2018 in Vienna. Contributions to both events are still welcome. Send your inquiry to email@example.com.
Wednesday, June 27, 2018 - 10:01am
NLI-GO DBPedia demo was published on wiki.dbpedia.org and is one of many other projects and applications featuring DBpedia.
Powered by WPeMatico
Tuesday, May 15, 2018 - 5:12pm
Working with data is hard and repetitive. That is why we are more than happy to announce the launch of the alpha version of our DBpedia Databus, a way that simplifies working with data.
We have studied the data network for already 10 years and we conclude that organizations with open data are struggling to work together properly. Even though they could and should collaborate, they are hindered by technical and organizational barriers. They duplicate work on the same data. On the other hand, companies selling data cannot do so in a scalable way. The consumers are left empty-handed and trapped between the choice of inferior open data or buying from a jungle-like market.
We need to rethink the incentives for linking data
We envision a hub, where everybody uploads data. In that hub, useful operations like versioning, cleaning, transformation, mapping, linking, merging, hosting are done automagically on a central communication system, the bus, and then again dispersed in a decentral network to the consumers and applications. On the Databus, data flows from data producers through the platform to the consumers (left to right), any errors or feedback flows in the opposite direction and reaches the data source to provide a continuous integration service and improves the data at the source.
The DBpedia Databus is a platform that allows exchanging, curating and accessing data between multiple stakeholders. Any data entering the bus will be versioned, cleaned, mapped, linked and its licenses and provenance tracked. Hosting in multiple formats will be provided to access the data either as dump download or as API.
Publishing data on the Databus means connecting and comparing your data to the network
If you are grinding your teeth about how to publish data on the web, you can just use the Databus to do so. Data loaded on the bus will be highly visible, available and queryable. You should think of it as a service:
- Visibility guarantees, that your citations and reputation goes up.
- Besides a web download, we can also provide a Linked Data interface, SPARQL-endpoint, Lookup (autocomplete) or other means of availability (like AWS or Docker images).
- Any distribution we are doing will funnel feedback and collaboration opportunities your way to improve your dataset and your internal data quality.
- You will receive an enriched dataset, which is connected and complemented with any other available data (see the same folder names in data and fusion folders).
How it works at the moment
Integration of data is easy with the Databus. We have been integrating and loading additional datasets alongside DBpedia for the world to query. Popular datasets are ICD10 (medical data) and organizations and persons. We are still in an initial state, but we already loaded 10 datasets (6 from DBpedia, 4 external) on the bus using these phases:
- Acquisition: data is downloaded from the source and logged in.
- Conversion: data is converted to N-Triples and cleaned (Syntax parsing, datatype validation, and SHACL).
- Mapping: the vocabulary is mapped on the DBpedia Ontology and converted (We have been doing this for Wikipedia’s Infoboxes and Wikidata, but now we do it for other datasets as well).
- Linking: Links are mainly collected from the sources, cleaned and enriched.
- IDying: All entities found are given a new Databus ID for tracking.
- Clustering: ID’s are merged onto clusters using one of the Databus ID’s as cluster representative.
- Data Comparison: Each dataset is compared with all other datasets. We have an algorithm that decides on the best value, but the main goal here is transparency, i.e. to see which data value was chosen and how it compares to the other sources.
- A main knowledge graph fused from all the sources, i.e. a transparent aggregate.
- For each source, we are producing a local fused version called the “Databus Complement”. This is a major feedback mechanism for all data providers, where they can see what data they are missing, what data differs in other sources and what links are available for their IDs.
- You can compare all data via a web service.
Contact us via firstname.lastname@example.org if you would like to have additional datasets integrated and maintained alongside DBpedia.
From your point of view
If you are selling data, the Databus provides numerous opportunities for you. You can link your offering to the open entities in the Databus. This allows consumers to discover your services better by showing it with each request.
Open data on the Databus will be a commodity. We are greatly downing the cost of understanding the data, retrieving and reformatting it. We are constantly extending ways of using the data and are willing to implement any formats and APIs you need. If you are lacking a certain kind of data, we can also scout for it and load it onto the Databus.
Is it free?
Maintaining the Databus is a lot of work and servers incurring a high cost. As a rule of thumb, we are providing everything for free that we can afford to provide for free. DBpedia was providing everything for free in the past, but this is not a healthy model, as we can neither maintain quality properly nor grow.
On the Databus everything is provided “As is” without any guarantees or warranty. Improvements can be done by the volunteer community. The DBpedia Association will provide a business interface to allow guarantees, major improvements, stable maintenance, and hosting.
Final databases are licensed under ODC-By. This covers our work on recomposition of data. Each fact is individually licensed, e.g. Wikipedia abstracts are CC-BY-SA, some are CC-BY-NC, some are copyrighted. This means that data is available for research, informational and educational purposes. We recommend to contact us for any professional use of the data (clearing) so we can guarantee that legal matters are handled correctly. Otherwise, professional use is at own risk.
The Databus data is available at http://downloads.dbpedia.org/databus/ ordered into three main folders:
- Data: the data that is loaded on the Databus at the moment
- Global: a folder that contains provenance data and the mappings to the new IDs
- Fusion: the output of the Databus
Most notably you can find:
- Provenance mapping of the new ids in global/persistence-core/cluster-iri-provenance-ntriples/<http://downloads.dbpedia.org/databus/global/persistence-core/cluster-iri... and global/persistence-core/global-ids-ntriples/<http://downloads.dbpedia.org/databus/global/persistence-core/global-ids-...
- The final fused version for the core: fusion/core/fused/<http://downloads.dbpedia.org/databus/fusion/core/fused/>
- A detailed JSON-LD file for data comparison: fusion/core/json/<http://downloads.dbpedia.org/databus/fusion/core/json/>
- Complements, i.e. the enriched Dutch DBpedia Version: fusion/core/nl.dbpedia.org/<http://downloads.dbpedia.org/databus/fusion/core/nl.dbpedia.org/>
(Note that the file and folder structure are still subject to change)
- Include more existing data from DBpedia
- Renew all DBpedia releases in a separate fashion:
- Load all data in the comparison tool: http://188.8.131.52:9000/?s=http%3A%2F%2Fid.dbpedia.org%2Fglobal%2F12HpzV&p=http%3A%2F%2Fdbpedia.org%2Fontology%2Farchitect&src=general
- Load all data into a SPARQL endpoint
- Create a simple open source software that let’s everybody push data on the Databus in an automated way
- build your own data inventory and merchandise your data via Linked Data or via secure named graphs in the DBpedia SPARQL Endpoint (WebID + TLS + OpenLink’s Virtuoso database)
- Offer your Linked Data tools, services, products
- Incubate new research into products
- Example: Support for RDFUnit (https://github.com/AKSW/RDFUnit created by the SHACL editor), assistance with SHACL writing and deployment of the open-source software
DBpedia and the Databus will transform Linked Data into a networked data economy
For any questions or inquiries related to the new DBpedia Databus, please contact us via email@example.com
The post The DBpedia Databus – transforming Linked Data into a networked data economy appeared first on DBpedia Blog.