Big Data Analytics Infrastructure

Rmux is a Redis connection pooler and multiplexer, written in Go. Rmux is meant to be used for LAMP stacks, or other short-lived process applications, with high request volume. It should be run as a client, on every server that connects to redis - to reduce the total inbound connection count to the redis servers, while handle consistent multiplexing.
Access 264 data sets at http://archive.ics.uci.edu/ml/datasets.html.
The UCI Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms.
Datasets from UCI also available at http://www.sgi.com/tech/mlc/db/.
Identifying some of the most influential algorithms that are widely used in the data mining community, The Top Ten Algorithms in Data Mining provides a description of each algorithm, discusses its impact, and reviews current and future research.
In a new HuffPost/YouGov poll, only 36 percent of Americans reported having "a lot" of trust that information they get from scientists is accurate and reliable. Fifty-one percent said they trust that information only a little, and another 6 percent said they don't trust it at all. See: http://huff.to/19Joyn5
A new book "Predictive Business Analytics" by Gary Cokins, a Data Science Assocation Advisory Board Member. Gary is a great writer who is an expert with talent for simplifying complex subjects with clarity. Gary is a trusted advisor and I strongly recommend that you purchase this book.