Randyzwitch - randyzwitch.com - randyzwitch.com

Latest News:

RSiteCatalyst Version 1.1 Release Notes 25 Aug 2013 | 10:54 pm

RSiteCatalyst version 1.1 is now available on CRAN. Changes from version 1 include: Support for Correlations/Subrelations in the QueueRanked function Support for Current Data in all ‘Queue‘ function...

Getting Started Using Hadoop, Part 4: Creating Tables With Hive 23 Aug 2013 | 03:03 am

In the previous three tutorials (1 , 2, 3), we’ve covered the background of Hadoop, how to build a proof-of-concept Hadoop cluster using Amazon EC2 and how to upload a .zip file to the cluster using H...

Anomaly Detection Using The Adobe Analytics API 15 Aug 2013 | 07:57 pm

As digital marketers & analysts, we’re often asked to quantify when a metric goes beyond just random variation and becomes an actual “unexpected” result. In cases such as A/B..N testing, it’s easy to ...

Tabular Data I/O in Julia 6 Aug 2013 | 07:05 pm

Importing tabular data into Julia can be done in (at least) three ways: reading a delimited file into an array, reading a delimited file into a DataFrame and accessing databases using ODBC. Reading a...

Amazon Elastic MapReduce with Python 31 Jul 2013 | 09:34 pm

In a previous rant about data science & innovation, I made reference to a problem I’m having at work where I wanted to classify roughly a quarter-billion URLs by predicted website content (without hav...

A Beginner’s Look at Julia 23 Jul 2013 | 09:16 pm

Over the past month or so, I’ve been playing with a new scientific programming language called ‘Julia‘, which aims to be a high-level language with performance approaching that of C. With that goal in...

Getting Started Using Hadoop, Part 3: Loading Data 22 May 2013 | 08:39 pm

In part 2 of the “Getting Started Using Hadoop” series, I discussed how to build a Hadoop cluster on Amazon EC2 using Cloudera CDH. This post will cover how to get your data into the Hadoop Distribute...

Innovation Will Never Be At The Push Of A Button 17 May 2013 | 07:28 pm

@randyzwitch @benjamingaines @usujason I am envisioning the data science equivalent of an autonomous vehicle pileup. — Todd Belcher (@toddmetrics) May 16, 2013   Recently, I’ve been getting my blood p...

Getting Started Using Hadoop, Part 2: Building a Cluster 26 Apr 2013 | 02:33 am

In Part 1 of this series, I discussed some of the basic concepts around Hadoop, specifically when it’s appropriate to use Hadoop to solve your data engineering problems and the terminology of the Hado...

Getting Started Using Hadoop, Part 1: Intro 18 Apr 2013 | 09:47 pm

For the last couple of days I’ve been at the eMetrics conference in San Francisco. There were several panels that discussed big data, both from an engineering standpoint as well as how to adopt newer ...

Recently parsed news:

Recent searches: