Cart
Free US shipping over $10
Proud to be B-Corp

Data Crunching Greg Wilson

Data Crunching By Greg Wilson

Data Crunching by Greg Wilson


$10.00
Condition - Like New
Only 1 left

Summary

Programmers always need to move data around, to translate it from one format to another, or extract useful information from it. This book shows the best approaches to crunching data in several programming languages; all widely-used data formats are covered, including plain text, XML, relational, and binary data.

Data Crunching Summary

Data Crunching: Solve Everyday Problems Using Java, Python and More by Greg Wilson

Every day, all around the world, programmers have to recycle legacy data, translate from one vendor's proprietary format into another's, check that configuration files are internally consistent, and search through web logs to see how many people have downloaded the latest release of their product. This kind of data crunching, may not be glamorous, but knowing how to do it efficiently is essential to being a good programmer. This book describes the most useful data crunching techniques, explains when you should use them, and shows how they will make your life easier. Along the way, it will introduce you to some handy, but under-used, features of Java, Python, and other languages. It will also show you how to test data crunching programs, and how data crunching fits into the larger software development picture.

About Greg Wilson

Greg Wilson holds a Ph.D. in Computer Science from the University of Edinburgh, and has worked on high-performance scientific computing, data visualization, and computer security. He is the author of Practical Parallel Programming (MIT Press, 1995), and is a contributing editor at Doctor Dobb's Journal, and an adjunct professor in Computer Science at the University of Toronto.

Table of Contents

1 Introduction 1 1.1 Name That Molecule 1.2 There's One in Every Crowd 1.3 And the Moral Is 1.4 Questions About Data Crunching 1.5 Road Map 2 Text 8 2.1 Reversing a File 2.2 Reformatting Data 2.3 Handling Multiline Records 2.4 Checking for Collisions 2.5 Including One File in Another 2.6 The Unix Shell 2.7 Very Large Data Sets 2.8 Summary 8 3 Regular Expressions 3.1 The Shell 3.2 Basic Patterns 3.3 Extracting Matched Values 3.4 Practical Applications 3.5 Speaking in Tongues 3.6 Other Systems 3.7 Summary 4 XML 4.1 AQuick Introduction 4.2 SAX 4.3 DOM 4.4 XPath 4.5 XSLT 4.6 Summary 5 Binary Data 5.1 Numbers 5.2 Input and Output 5.3 Strings 5.4 Summary 6 Relational Databases 6.1 Simple Queries 6.2 Nesting and Negation 6.3 Aggregation and Views 6.4 Creating, Updating, and Deleting 6.5 Using SQL in Programs 6.6 Summary 7 Horseshoe Nails 7.1 Unit Testing 7.2 Encoding and Decoding 7.3 Floating-Point Arithmetic 7.4 Dates and Times 7.5 Summary A Resources A.1 Bibliography

Additional information

GOR013562730
9780974514079
0974514071
Data Crunching: Solve Everyday Problems Using Java, Python and More by Greg Wilson
Used - Like New
Paperback
The Pragmatic Programmers
20050510
193
N/A
Book picture is for illustrative purposes only, actual binding, cover or edition may vary.
The book has been read, but looks new. The book cover has no visible wear, and the dust jacket is included if applicable. No missing or damaged pages, no tears, possible very minimal creasing, no underlining or highlighting of text, and no writing in the margins

Customer Reviews - Data Crunching