On Wednesday I will attend the SMILA (Semantic Information Logistics Architecture) conference in Berlin.
SMILA is a java framework which allows you to access and search unstructured information.
Example: The WebCrawlerPipelet extracts content from a website and the LuceneIndexPipelet stores the content in a specified lucene index. Via the http-requests you can access and search the index instantly – (have a look at the “5 Minutes to Success” article).
As the name ”pipelet” implies, you can assemble various pipelets in one pipeline (using BPEL). For me that’s the key feature of SMILA: assemble different services to enrich your content. That’s how we are currently exploring SMILA for veeeb.
As you may know, we are currently getting information about extracted topics (or entities) in veeeb via freebase.com – for example we get freebase information about the person Heidi Klum. freebase has a lot of topics and a lot of information about them. But its all in english and its only one repository.
Via SMILA we will be able to aggregate information in other sources like wikipedia and even custom repositories for example about persons in your company and provide the author with this information. This new service we are developing is not only useful in veeeb but will also be useful in other applications.
I am looking forward to the conference to see how others are using SMILA.
Have a look at SMILA.
Christoph
No comments 










Subscribe to RSS-Feed
Become a fan on Facebook
follow veeeb on Twitter