The AI Works community logo The Blockchain Works community logo The Functional Works community logo The Golang Works community logo The Java Works community logo The JavaScript Works community logo The Python Works community logo The Remote Works community logo The WorksHub company logo

We use cookies and other tracking technologies to improve your browsing experience on our site, analyze site traffic, and understand where our audience is coming from. To find out more, please read our privacy policy.

By choosing 'I Accept', you consent to our use of cookies and other tracking technologies.

We use cookies and other tracking technologies to improve your browsing experience on our site, analyze site traffic, and understand where our audience is coming from. To find out more, please read our privacy policy.

By choosing 'I Accept', you consent to our use of cookies and other tracking technologies. Less

We use cookies and other tracking technologies... More

Login or register
to publish this job!

Login or register
to save this job!

Login or register
to save interesting jobs!

Login or register
to get access to all your job applications!

Login or register to start contributing with an article!

Login or register
to see more jobs from this company!

Login or register
to boost this post!

Show some love to the author of this blog by giving their post some rocket fuel 🚀.

Login or register to search for your ideal job!

Login or register to start working on this issue!

Login or register
to save articles!

Login to see the application

Engineers who find a new job through AI Works average a 15% increase in salary 🚀

You will be redirected back to this page right after signin

List of additional Joycean compounds

Work started
Pull requests: 0
Contributors: 21
Level: Intermediate
  • HTML
Work started
Pull requests: 0
Contributors: 21
Level: Intermediate
  • HTML

On GitHub

James Joyce's novel Ulysses in TEI XML. Work-in-progress.
More info >

Issue posted by: 
droher's avatar

David Roher

Description

I wrote a hacky algorithm to find likely Joycean compounds. It excludes any words already tagged as compounds in the XML, as well as any words inside of a foreign language tag. There are plenty of false positives, but it does a pretty good job at sending likely ones to the top of the list:
compound_guesses.txt

I'd be happy to put up a PR to add a bunch of these to the XML, but I wanted to check before I did to see if you'd be interested/if that was the best way to go about it. Thanks!

    Use Open Source to hire or get hired

    On GitHub

    James Joyce's novel Ulysses in TEI XML. Work-in-progress.
    More info >

    Issue posted by: 
    droher's avatar

    David Roher

    Use Open Source to hire or get hired

    List of additional Joycean compounds
    View on GitHub