A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Fast SHAP value computation for interpreting tree-based models
Apache Hive
Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop
Last updated: 3 days agoAdFullSsl is a tool that can automatically detect SSL non-compliant ads and fix them
Last updated: 3 days agoAPI Hub is a web UI for browsing and searching a catalog of Rest.li APIs.
Last updated: 3 days agoSepia is a VCR-like module for node.js that records HTTP interactions, then plays them back exactly like the first time they were invoked
Last updated: 3 days agoLafayette is a system to store various email abuse reports sent in ARF.
Last updated: 3 days agoArchetype is a Compass/Sass based framework for authoring configurable, composable UI components and patterns.
Last updated: 3 days agoHadoop library for large-scale data processing, now an Apache Incubator project
Last updated: 3 days agoA flexible, partial, out-of-order and real-time typeahead search library
Last updated: 3 days agoCentral hub for distributing web apps to multiple browsers on multiple environments
Last updated: 3 days agoNorbert is a cluster manager and networking layer built on top of Zookeeper.
Last updated: 3 days ago