head-meeting

Groovy SIG

Making Tests and DAOs more Groovy

October 23, 2012 Until we get Java 8 and Lambdas, Java programming is stuck with the scourge of anonymous inner classes for providing support to mocking objects and stubbing implementations for unit tests. Additionally we are handcuffed by rigid schemas

AJUG Meetup

Large-scale Entity Extraction and Probabilistic Record Linkage

Tuesday, August 19, 2014

Large-scale entity extraction, disambiguation and linkage in Big Data can challenge the traditional methodologies developed over the last three decades. Entity linkage, in particular, is cornerstone for a wide spectrum of applications, such as Master Data Management, Data Warehousing, Social Graph Analytics, Fraud Detection and Identity Management. Traditional rules based heuristic methods usually don’t scale properly, are language specific and require significant maintenance over time.

We will introduce the audience to the use of probabilistic record linkage, also known as specificity based linkage, on Big Data, to perform language independent large-scale entity extraction, resolution and linkage across diverse sources. We will also present a live demonstration reviewing the different steps required during the data integration process (ingestion, profiling, parsing, cleansing, standardization and normalization), and show the basic concepts behind probabilistic record linkage on a real-world application.

Location:


Holiday Inn Atlanta-Perimeter/Dunwoody

4386 Chamblee Dunwoody Road,
Atlanta, GA (map)

AJUG Tweets

Follow @atlantajug on twitter.