7: Further Reading
Tools to Consider:
Wordle is a simple text analysis tool that Julie Meloni calls a “gateway drug” to text analysis by creating word clouds of text.
Google Ngrams allows for text searches of phrases up to five words across a corpus of 5 million digitized books.
Voyeur and Tapor both offer browser interfaces for performing basic text analysis.
MALLET is more advanced program for completing, among others, topic modeling.
Stanford’s Natural Language Processing Group has developed a range of advanced computational linguistics software.
R is an open-source statistical package that is particularly useful for managing large datasets.
Further Reading:
Gregory Crane, “What Do You Do With a Million Books?” D-Lib Magazine. March 2006. Volume 12 Number 3.http://www.dlib.org/dlib/march06/crane/03crane.html
Tanya Clement et. al., “How Not to Read a Million Books.” Online: http://www3.isrl.illinois.edu/~unsworth/hownot2read.html
Text Analysis Developers Alliance, “What is Text Analysis?”
http://tada.mcmaster.ca/Main/WhatTA
Aditi Muralidharan, “Extracting Social Networks From 19th Century Novels.” Online: http://mininghumanities.com/2010/09/13/social-networks-19th-century/
Geoffrey Nunberg, “Google’s Book Search: A Disaster for Scholars.” The Chronicle Review, 31 August 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/
Sharon Block, “Doing More With Digitization,” Common-Place Blog: http://www.common-place.org/vol-06/no-02/tales/
Franco Moretti, Graphs, Maps, Trees: Abstract Models for a Literary History (New York: Verso, 2005).
Franco Moretti, “Conjectures on World Literature,” New Left Review (2000): 54-68.