Join AJUG

AJUG Member Community

We provide an open access list for AJUG and Java-related topics.

  • You must subscribe in order to post.
  • Suitable content includes ONLY Java technical questions. Please respect your peers and use common sense when you post.

To subscribe, send an email message to ajug-members-subscribe@ajug.org with “subscribe” in the subject.

To unsubscribe, send an email message to ajug-members-unsubscribe@ajug.org with “unsubscribe” in the subject. You must unsubscribe from the same email address under which you subscribed. So if you are changing providers or jobs, remember to unsubscribe before you make the move.

Posting a Message

To post a message, simply send it to ajug-members@www.ajug.org.

 

AJUG Meetup

Large-scale Entity Extraction and Probabilistic Record Linkage

Tuesday, August 19, 2014

Large-scale entity extraction, disambiguation and linkage in Big Data can challenge the traditional methodologies developed over the last three decades. Entity linkage, in particular, is cornerstone for a wide spectrum of applications, such as Master Data Management, Data Warehousing, Social Graph Analytics, Fraud Detection and Identity Management. Traditional rules based heuristic methods usually don’t scale properly, are language specific and require significant maintenance over time.

We will introduce the audience to the use of probabilistic record linkage, also known as specificity based linkage, on Big Data, to perform language independent large-scale entity extraction, resolution and linkage across diverse sources. We will also present a live demonstration reviewing the different steps required during the data integration process (ingestion, profiling, parsing, cleansing, standardization and normalization), and show the basic concepts behind probabilistic record linkage on a real-world application.

Location:


Holiday Inn Atlanta-Perimeter/Dunwoody

4386 Chamblee Dunwoody Road,
Atlanta, GA (map)

AJUG Tweets

Follow @atlantajug on twitter.