TensorFlow Machine Learning System
Docker Container to Provide Apache Tika RESTful API
Machine Learning Workflow
CSVLint: Check and Improve CSV Files
CSVLint helps you to check that your CSV file is readable. And you can use it to check whether it contains the columns and types of values that it should.
Musical Taste Correlates with SAT Scores?
Does human intelligence have any connection to the type of music a person listens to? Can we define human intelligence? How do you measure human intelligence? Do SAT scores accurately measure human intelligence? Is there any evidence that SAT scores accurately predict educational or workplace performance?
I am skeptical 1) that we can measure human intelligence at this time; and 2) that SAT scores are an accurate measurement of anything save a very narrow form of test-taking ability that adds little if any value in the real world.
Feature Forge: Tools for Creating and Testing Machine Learning Features
The Feature Forge library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).
Most machine learning problems involve an step of feature definition and preprocessing. Feature Forge helps you with:
Data Scientists Must Focus on Data Security Risk
GearPump: Actor Driven Streaming Framework
GearPump is a lightweight real-time big data streaming engine. Inspired by recent advances in the Akka framework and a desire to improve on existing streaming frameworks. Inspired by MillWheel, Storm, spark streaming, and SAMZA. Benchmarks shows process 2 million messages/second (100 bytes per message) with latency around 30ms on a cluster of 4 nodes.