Motivation: Naming patterns and name distribution from
one generation to the next are important criteria for determining, among
others, preservation of family- and self-identity and religious traditions.
In the present case, our tool extracts data relevant to the identification of
father-son naming patterns.
Implementation: This tool is based on a python script
that locates instances of phrases such as "X son of Y" in the texts, and
keeps note of the two names along with their family relationship.
The script's output is parsed by a Java Servlet, which extends the
DataSourceServlet class of the Google Visualizations API.
The client runs a javascript which contacts the server to retrieve this data,
parses it and displays it to the user.
Please note that the script does not yet read the lemmatization data of the
ATFs to decide on the ethnicity of each PN. This ethnicity is currently
inserted by hand for the demonstration.
Comments: We hope that the extraction of names and family
relationships will be replaced in the near future with a utility common to
all the Oracc-based projects.
Demonstration of the tool