Python Projects
From dandi08
These projects should consist of a substantial Python program that does something related to linguistics, Web 2.0, or the “deep” web.
- Code-Breaker will be an attempt to try to make a program which can crack texts encoded in simple ciphers without the required decryption key. NLTK will be involved, and maybe Wordnet.
- Poetry Generator is a program, written in Python, that algorithmically creates poetry. There are three basic levels that could be implemented.
- HTML parser with NLTK A program that strips web pages found in search queries and reorganizes the useful information on a webpage that is uploaded to a server.
- A Machine Learning study would involve research, math (omg no!), and perhaps implementation in python.
- A Morpheme Tree Generator designed in Python to output a tree diagram of the morphemes of words.
- X-TREME CROSSWORD GENERATION: 2 THA MAXX: a crossword generator written in Python.
- Yoda Chatbot -- program a 3000-year-old Jedi master, you could
- Song Generator The guitar hero of song writing, a way for lazy and uncreative people to make music! This could be a simple or complicated as the group is interested in pursuing.
- Spell Check -- A program to open a text file and perform a spell check.
- Machine Translation -- A program to parse vocabulary and create a probable English translation of a Latin sentence.
- Reverse Dictionary -- A Python program that allows a user to look up words by meaning, part of speech, or sequences of characters using NLTK and Wordnet.
- The Language of Emotion is a project that will combine Python, NLTK, and a bit of voyeurism.
- Haiku Generation Probabilistic/ Poetry generation/ Styled after art
- Ontological Text Parser A program that combines language parsing and ontologies with simple heuristics to simulate human cognition.
- Verb Conjugator -- A python program to conjugate basic English verbs.
- IRC Bot "Doctorow Expert" -- A python bot written to answer questions about the works of Cory Doctorow. It would translate human language questions into nltk searches. Doctorow was chosen because of his pro-remix copyright stance.
- Language Identifier -- A linguistics-based project that uses Python to read a text file and identify the language in which it's written.