« A2Z Midterm - Blog Generated Exquisite Corpse | Main | Streets »

A2Z Midterm - Generative text

For this midterm assignment I wanted to work and play with generative text, grabbing text form personal blogs in the web. Using the logic of the Exquisite Corpse, the program, based on a word chosen by the user, starts to find it into the first text file and extracts the word and the 11 words that follows. Then, the last word extract became the keyword that the program will look in the following text and the process starts to repeat itself till it doesn't find more phrases.

To make the process easier, I started by copying text from 5 blogs and pasting into separate txt files, simulating different url's and what the crawler could get.
The next stop will be creating the crawler to look on the web for personal blogs.

The result needs to be tuned a little more but once I get the crawler working, I can see better the results and improve the program.

here are some examples of the output:

generate2.png

generate3.png

generate4.png

see source code

One idea of where this could go is to create a fake blog, writing a program that everyday crawl personal blogs, search for posts with the same date and using this set rules, creates a "fake" post or a mash/up post.

It was an interesting exercise to get my hands in regular expressions and feel more comfortable with programming in java

TrackBack

TrackBack URL for this entry:
http://www.prntscreen.net/cgi-bin/mt/mt-tb.cgi/376

Post a comment

(If you haven't left a comment here before, you may need to be approved by the site owner before your comment will appear. Until then, it won't appear on the entry. Thanks for waiting.)

About

This page contains a single entry from the blog posted on March 4, 2008 1:52 PM.

The previous post in this blog was A2Z Midterm - Blog Generated Exquisite Corpse.

The next post in this blog is Streets.

Many more can be found on the main index page or by looking through the archives.

Powered by
Movable Type 3.35