Text Mining – THATCamp Lehigh Valley 2013 http://lehigh2013.thatcamp.org The Humanities and Technology Camp Wed, 06 Mar 2013 16:38:35 +0000 en-US hourly 1 https://wordpress.org/?v=4.9.12 [Talk/Make] The Untapped Power of Digital Readers: Why are we still reading the same way? http://lehigh2013.thatcamp.org/02/26/talkmake-the-untapped-power-of-digital-readers-why-are-we-still-reading-the-same-way/ http://lehigh2013.thatcamp.org/02/26/talkmake-the-untapped-power-of-digital-readers-why-are-we-still-reading-the-same-way/#comments Tue, 26 Feb 2013 21:58:10 +0000 http://lehigh2013.thatcamp.org/?p=294 Continue reading ]]>

In the near future (or maybe even now?), people will do their reading almost exclusively via device (i.e. on a digital reader, laptop, tablet, and/or smartphone) instead of  via paper.  Given this change of medium, some questions naturally follow:  Should digital readers be used like traditional books (i.e. just to display text)?  Or can technology be integrated into the reading experience without becoming distracting?

I propose a session in which we investigate the latter question.  I believe we can create integrated text analysis and mining tools that will not only improve students’ reading comprehension but also aid anyone’s literary scholarship.  I will start my session by introducing two techniques—sentiment analysis and social network extraction—that work toward this goal, and then the session will open-up into a discussion of what kinds of computational text analysis and visualization tools could be intergraded into literary analysis and the digital reading experience and how useful these tools would be.  If there is interest, we could even try to hack our own text analysis / reading aide tool.  Below are some examples of the capabilities of automated sentiment analysis and social network extraction.

Sentiment Analysis:

  • Visualization of sentiment in the Bible: shows the old testament to be generally negative and the new testament to be generally positive.
  • The picture below shows the results of sentiment analysis on Shakespeare’s Othello. Othello’s sentiment (how positive/negative he feels) towards Desdemona is algorithmically tracked over the course of the play and is represented by the black line, which we see drastically declines over time. Screen Shot 2013-02-26 at 4.31.50 PM

Social Network Extraction:

  • A social network algorithmically extracted from Hamlet: Screen Shot 2013-02-26 at 4.37.21 PM

 

]]>
http://lehigh2013.thatcamp.org/02/26/talkmake-the-untapped-power-of-digital-readers-why-are-we-still-reading-the-same-way/feed/ 1