These are wonderful, easy to use tools to explore whether you want to learn more about text analysis. Voyant is a free web browser and AntConc is a free application available to download.
Supported by Indiana University and University of Illinois at Urbana-Champaign, HathiTrust Research Center (HTRC) enables computational analysis of works in the HathiTrust Digital Library (HTDL) to facilitate non-profit research and educational use of the collection.
As institutional members of HathiTrust, Columbia faculty and students have access to the HathiTrust Research Center. To access this portal, you must first Login to HathiTrust using your USC network id. Once signed in, you can read about the HathiTrust Research Center here. They have freely available text analytics algorithms that you can use on their data by signing up for a Research Center Analytics account. These include extracted feature sets, which include metadata from the many volumes in HathiTrust, Topic Modeling, Named Entity Recognizer,and Token Count or Word Clouds. Also, try the BookWorm .
This wiki page will help you Get Started with the HathiTrust Research Center.
These tools will take some learning, but there are people in Research Computing and in the Libraries who can help you get started.