Publisher: O'Reilly Media Released: January 2011 Pages: 360
Facebook, Twitter, and LinkedIn generate a tremendous amount of valuable social data, but how can you find out who's making connections with social media, what they’re talking about, or where they’re located? This concise and practical book shows you how to answer these questions and more. You'll learn how to combine social web data, analysis techniques, and visualization to help you find what you've been looking for in the social haystack, as well as useful information you didn't know existed. Each standalone chapter introduces techniques for mining data in different areas of the social Web, including blogs and email. All you need to get started is a programming background and a willingness to learn basic Python tools. - Get a straightforward synopsis of the social web landscape
- Use adaptable scripts on GitHub to harvest data from social network APIs such as Twitter, Facebook, and LinkedIn
- Learn how to employ easy-to-use Python tools to slice and dice the data you collect
- Explore social connections in microformats with the XHTML Friends Network
- Apply advanced mining techniques such as TF-IDF, cosine similarity, collocation analysis, document summarization, and clique detection
- Build interactive visualizations with web technologies based upon HTML5 and JavaScript toolkits
"Let Matthew Russell serve as your guide to working with social data sets old (email, blogs) and new (Twitter, LinkedIn, Facebook). Mining the Social Web is a natural successor to Programming Collective Intelligence: a practical, hands-on approach to hacking on data from the social Web with Python." --Jeff Hammerbacher, Chief Scientist, Cloudera "A rich, compact, useful, practical introduction to a galaxy of tools, techniques, and theories for exploring structured and unstructured data." --Alex Martelli, Senior Staff Engineer, Google |
-
Chapter 1 Introduction: Hacking on Twitter Data -
Installing Python Development Tools -
Collecting and Manipulating Twitter Data -
Closing Remarks -
Chapter 2 Microformats: Semantic Markup and Common Sense Collide -
XFN and Friends -
Exploring Social Connections with XFN -
Geocoordinates: A Common Thread for Just About Anything -
Slicing and Dicing Recipes (for the Health of It) -
Collecting Restaurant Reviews -
Summary -
Chapter 3 Mailboxes: Oldies but Goodies -
mbox: The Quick and Dirty on Unix Mailboxes -
mbox + CouchDB = Relaxed Email Analysis -
Threading Together Conversations -
Visualizing Mail “Events” with SIMILE Timeline -
Analyzing Your Own Mail Data -
Closing Remarks -
Chapter 4 Twitter: Friends, Followers, and Setwise Operations -
RESTful and OAuth-Cladded APIs -
A Lean, Mean Data-Collecting Machine -
Constructing Friendship Graphs -
Summary -
Chapter 5 Twitter: The Tweet, the Whole Tweet, and Nothing but the Tweet -
Pen : Sword :: Tweet : Machine Gun (?!?) -
Analyzing Tweets (One Entity at a Time) -
Juxtaposing Latent Social Networks (or #JustinBieber Versus #TeaParty) -
Visualizing Tons of Tweets -
Closing Remarks -
Chapter 6 LinkedIn: Clustering Your Professional Network for Fun (and Profit?) -
Motivation for Clustering -
Clustering Contacts by Job Title -
Fetching Extended Profile Information -
Geographically Clustering Your Network -
Closing Remarks -
Chapter 7 Google Buzz: TF-IDF, Cosine Similarity, and Collocations -
Buzz = Twitter + Blogs (???) -
Data Hacking with NLTK -
Text Mining Fundamentals -
Finding Similar Documents -
Buzzing on Bigrams -
Tapping into Your Gmail -
Before You Go Off and Try to Build a Search Engine… -
Closing Remarks -
Chapter 8 Blogs et al.: Natural Language Processing (and Beyond) -
NLP: A Pareto-Like Introduction -
A Typical NLP Pipeline with NLTK -
Sentence Detection in Blogs with NLTK -
Summarizing Documents -
Entity-Centric Analysis: A Deeper Understanding of the Data -
Closing Remarks -
Chapter 9 Facebook: The All-in-One Wonder -
Tapping into Your Social Network Data -
Visualizing Facebook Data -
Closing Remarks -
Chapter 10 The Semantic Web: A Cocktail Discussion -
An Evolutionary Revolution? -
Man Cannot Live on Facts Alone -
Hope -
Colophon |
- Title:
- Mining the Social Web
- By:
- Matthew A. Russell
- Publisher:
- O'Reilly Media
- Formats:
-
- Print
- Ebook
- Safari Books Online
- Print:
- February 2011
- Ebook:
- January 2011
- Pages:
- 360
- Print ISBN:
- 978-1-4493-8834-8
- | ISBN 10:
- 1-4493-8834-5
- Ebook ISBN:
- 978-1-4493-8835-5
- | ISBN 10:
- 1-4493-8835-3
|
Colophon The animal on the cover of Mining the Social Web is a groundhog (Marmota monax), also known as a woodchuck (a name derived from the Algonquin name wuchak). Groundhogs are famously associated with the US/Canadian holiday Groundhog Day, held every February 2nd. Folklore holds that if the groundhog emerges from its burrow that day and sees its shadow, winter will continue for six more weeks. Proponents say that the rodents forecast accurately 75 to 90 percent of the time. Many cities host famous groundhog weather prognosticators, including Punxsutawney Phil (of Punxsutawney, PA and the 1993 Bill Murray film). This legend perhaps originates from the fact that the groundhog is one of the few species that enters true hibernation during the winter. Primarily herbivorous, groundhogs will fatten up in the summer on vegetation, berries, nuts, insects, and the crops in human gardens, causing many to consider them pests. They then dig a winter burrow, and remain there from October to March (although they may emerge earlier in temperate areas, or, presumably, if they will be the center of attention on their eponymous holiday). The groundhog is the largest member of the squirrel family, around 16–26 inches long and weighing 4–9 pounds. They are equipped with curved, thick claws ideal for digging, and two coats of fur: a dense grey undercoat and a lighter colored topcoat of longer hairs, which provides protection against the elements. Groundhogs range throughout most of Canada and northern regions of the United States, in places where open space and woodlands meet. They are capable of climbing trees and swimming, but are usually found on the ground, not far from the burrows they dig for sleeping, rearing their young, and protection from predators. These burrows typically have two to five entrances, and up to 46 feet of tunnels. |
|
Description
|
Table of Contents
|
Product Details
|
About the Author
|
Colophon
|
 |
|
 |
|
|
|
Recommended for You
|
Recently Viewed
|
 |
|
By Matthew McCool
May 2007
By Rickford Grant, Rickford Grant
June 2007
By Joan M. Johnson
May 2007
|
Customer Reviews
7/7/2011 (1 of 1 customers found this review helpful) 4.0An excellent intro to mining social data By humedini from United Kingdom - Accurate
- Easy to understand
- Helpful examples
- Well-written
6/19/2011 4.0Nice entry point for text minining - Accurate
- Easy to understand
- Helpful examples
3/13/2011 (1 of 1 customers found this review helpful) 5.0Easy to read. I tore through it By wiebedj from Vancouver, BC - Easy to understand
- Helpful examples
3/8/2011 (2 of 3 customers found this review helpful) 3.0Interesting but lacking in practical exa By iamdavebowers from Boston 1/31/2011 (4 of 4 customers found this review helpful) 4.0A primer, but not a panacea. By Honest Isaac from San Antonio, TX About Me Designer, Developer, Sys Admin - Accurate
- Concise
- Straightforward examples
- Austere
- Relies on external docs
|
|
|
Merchant response: Thank you for this thoughtful review. If you're looking for a book that is more focused on examples, check out 21 Recipes for Mining Twitter (http://oreilly.com/catalog/9781449303167/).