A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.
Fast SHAP value computation for interpreting tree-based models
Apache Hive
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization a...
最近更新: 5天前DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.
最近更新: 5天前The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
最近更新: 5天前Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.
最近更新: 5天前DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
最近更新: 5天前A mobile interface for linkedin/iris, built for iOS and Android on the Ionic platform
最近更新: 5天前Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin f...
最近更新: 5天前