Home Page Icon
Home Page
Table of Contents for
Cover
Close
Cover
by Jens Albrecht, Sidharth Ramachandran, Christian Wink
Blueprints for Text Analytics Using Python
1. Gaining Early Insights from Textual Data
Exploratory Data Analysis
Introducing the Dataset
Blueprint: Building a Simple Text Preprocessing Pipeline
Blueprints for Word Frequency Analysis
Blueprint: Finding a Keyword in Context (KWIC)
Blueprint: Analyzing N-Grams
Blueprint: Comparing Frequencies across Time-Intervals and Categories
Closing Remarks
2. Scraping Websites and Extracting Data
What You’ll Learn and What We Will Build
Scraping and Data Extraction
Introducing the Reuters News Archive
URL Generation
Downloading Data
Extracting Semi-structured Data
Blueprint: Spidering
Density-based Text Extraction
All-in-one Approach
Possible Problems with Scraping
Closing Remarks and Recommendation
3. How to use text classification algorithms to identify and classify text into multiple categories
Introducing the Java Development Tools Bug Dataset
Blueprint: Building a Text Classification system
Final Blueprint for Text Classification
Cross-Validation
Hyperparameter Tuning with Grid Search
Blueprint recap and conclusion
Closing Remarks
Further Reading
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Next
Next Chapter
Blueprints for Text Analysis Using Python
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset