In information extraction frameworks, finding a matching for a word in a text is a very common issue. The matching is often done based in a given input dictionnary. This task is called Named Entity Recognition (NER). This task is useful to classify the words in the text. For instance, we could have a dictionnary…
Read MoreWe have been working on different initiatives to process large amounts of open data and to produce useful information, for instance, the Educational Data Lab or the Web Portal for Educational Resources. One recurrent difficulty is the initial data analysis and transformations, where it is necessary to understand the data before loading it into some…
Read MoreOur group has been working on different initiatives to produce useful open data to end users. One of them is the collection and monitoring for assessment and evaluation, Another one focuses on the integration and search of OER (Open Educational Data). In this post, I will focus on a third initiative, where we extract existing…
Read MoreOpen Educational Resources (OER) are often defined as freely available material, in some kind of media, to support teachers and students on the task of learning some subject. These resources are valuable material that any teacher could use in its teaching activities. However, despite having many existing available repositories hosting these kind of resources, it…
Read MoreInteroperability been data formats/metadata/applications has always been a research subject that I am interested on. Without searching very hard, I’ve found 4 of my publications with interoperability in the title! There are so many solutions covering different aspects, which makes it very hard to choose amongst the best data-format-application-framework-query-language-etc. We defined, with a MsC studeng…
Read MoreWe started to use MonetDB, a column store database in a couple of projects involving querying over our DataWarehouses, created from public open data. The goal of one of the projects, called Educational Data Lab, is to return information about educational indicators in Brazil, with its historical evolution. We have nowadays about 100Gb of data…
Read MoreDeveloping Open Data solutions is often challenging for different reasons. The first challenge, once the application is defined, is to get the correct data source and to process it in some useful way. There are several Open Data governmental initiatives worldwide, such as the ones from Brazil, USA or UK (to name only 3). Two…
Read More