gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization a...

最近更新: 5天前

gobblin-hive

Apache Hive

最近更新: 5天前

FastTreeSHAP

Fast SHAP value computation for interpreting tree-based models

最近更新: 5天前

dynoyarn

DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.

最近更新: 5天前

feathr

Feathr – A scalable, unified data and AI engineering platform for enterprise

最近更新: 5天前

beam

Apache Beam is a unified programming model for Batch and Streaming

最近更新: 5天前

data-integration-library

The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.

最近更新: 5天前

tracked-queue

An autotracked implementation of a ring-buffer-backed double-ended queue

最近更新: 5天前

lambda-learner

Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.

最近更新: 5天前

datahub-gma

General Metadata Architecture

最近更新: 5天前

linkedin-orc

LinkedIn's version of Apache ORC

最近更新: 5天前

zookeeper

Mirror of Apache Hadoop ZooKeeper

最近更新: 5天前

linkedin-calcite

LinkedIn's version of Apache Calcite

最近更新: 5天前

spark-tfrecord

Read and write Tensorflow TFRecord data from Apache Spark.

最近更新: 5天前

detext

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

最近更新: 5天前

agent-loader

EA Agent Loader is a collection of utilities for java agent developers.

最近更新: 5天前

kafka

Mirror of Apache Kafka

最近更新: 5天前

iris-mobile

A mobile interface for linkedin/iris, built for iOS and Android on the Ionic platform

最近更新: 5天前

Tachyon

An Android library that provides a customizable calendar day view UI widget.

最近更新: 5天前

apache-incubator-gobblin

Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin f...

最近更新: 5天前
成就
2
Star
1
Fork
成员(1)
镜像

搜索帮助