Python Projects

From dandi08

Jump to: navigation, search

These projects should consist of a substantial Python program that does something related to linguistics, Web 2.0, or the “deep” web.

  • Code-Breaker will be an attempt to try to make a program which can crack texts encoded in simple ciphers without the required decryption key. NLTK will be involved, and maybe Wordnet.
  • Poetry Generator is a program, written in Python, that algorithmically creates poetry. There are three basic levels that could be implemented.
  • HTML parser with NLTK A program that strips web pages found in search queries and reorganizes the useful information on a webpage that is uploaded to a server.
  • A Machine Learning study would involve research, math (omg no!), and perhaps implementation in python.
  • A Morpheme Tree Generator designed in Python to output a tree diagram of the morphemes of words.
  • X-TREME CROSSWORD GENERATION: 2 THA MAXX: a crossword generator written in Python.
  • Yoda Chatbot -- program a 3000-year-old Jedi master, you could
  • Song Generator The guitar hero of song writing, a way for lazy and uncreative people to make music! This could be a simple or complicated as the group is interested in pursuing.
  • Spell Check -- A program to open a text file and perform a spell check.
  • Machine Translation -- A program to parse vocabulary and create a probable English translation of a Latin sentence.
  • Reverse Dictionary -- A Python program that allows a user to look up words by meaning, part of speech, or sequences of characters using NLTK and Wordnet.
  • The Language of Emotion is a project that will combine Python, NLTK, and a bit of voyeurism.
  • Haiku Generation Probabilistic/ Poetry generation/ Styled after art
  • Ontological Text Parser A program that combines language parsing and ontologies with simple heuristics to simulate human cognition.
  • Verb Conjugator -- A python program to conjugate basic English verbs.
  • IRC Bot "Doctorow Expert" -- A python bot written to answer questions about the works of Cory Doctorow. It would translate human language questions into nltk searches. Doctorow was chosen because of his pro-remix copyright stance.
  • Language Identifier -- A linguistics-based project that uses Python to read a text file and identify the language in which it's written.