7: Further Reading
Tools to Consider:
Google Ngrams allows for text searches of phrases up to five words across a corpus of 5 million digitized books.
MALLET is more advanced program for completing, among others, topic modeling.
Stanford’s Natural Language Processing Group has developed a range of advanced computational linguistics software.
R is an open-source statistical package that is particularly useful for managing large datasets.
Gregory Crane, “What Do You Do With a Million Books?” D-Lib Magazine. March 2006. Volume 12 Number 3.http://www.dlib.org/dlib/march06/crane/03crane.html
Tanya Clement et. al., “How Not to Read a Million Books.” Online: http://www3.isrl.illinois.edu/~unsworth/hownot2read.html
Text Analysis Developers Alliance, “What is Text Analysis?”
Aditi Muralidharan, “Extracting Social Networks From 19th Century Novels.” Online: http://mininghumanities.com/2010/09/13/social-networks-19th-century/
Geoffrey Nunberg, “Google’s Book Search: A Disaster for Scholars.” The Chronicle Review, 31 August 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/
Sharon Block, “Doing More With Digitization,” Common-Place Blog: http://www.common-place.org/vol-06/no-02/tales/
Franco Moretti, Graphs, Maps, Trees: Abstract Models for a Literary History (New York: Verso, 2005).
Franco Moretti, “Conjectures on World Literature,” New Left Review (2000): 54-68.