Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Contributor Guideline & Design Doc Collections[Current Status Quo & Future Plan] #11

Open
ashione opened this issue Jan 20, 2022 · 0 comments
Assignees
Labels
good first issue Good for newcomers

Comments

@ashione
Copy link
Member

ashione commented Jan 20, 2022

Main Modules


Reference & Design Doc

Modules of Ray Streaming:

  • CrossLang API
  • Runtime
  • Coordinator/Scheduler
  • Reliability/Fault Tolerance
  • Fusion Training
  • Advanced Functions

✅ : Finished
⚠️ : Move From Internal/Developing
🏃🏻‍♀️: Welcome to contribute(If you want to contribute features please let us know)

CrossLang API

X-Lang:

  • Java <-> Python ✅
  • Arrow ⚠️

API:

  • Source ✅
  • Union ✅
  • Sink ✅
  • Map ✅
  • FlatMap ✅
  • Reduce ✅
  • Join ⚠️
  • Window ⚠️

Popular Connectors:

  • Kafka 🏃🏻‍♀️
  • Mysql 🏃🏻‍♀️
  • etc.

Runtime

Transfer:

  • Low Latency RingBuffer Auto Flush ✅
  • RandomShuffle ✅
  • HashShuffle ✅
  • DynamicRebalance ⚠️
  • Colocated SharedMemory ⚠️
  • FlowControl ⚠️ (Without Empty Message): 04 Feb Empty Message

State Backend:

  • State backend common library ⚠️
  • Memory Backend ✅
  • Rocksdb Backend ⚠️
  • S3 Backend 🏃🏻‍♀️

Buffer Optimization:

  • BufferPool ⚠️ Part1 28 Mar
  • ElasticBuffer ⚠️

Coordinator/Scheduler

Coordinator:

  • Rewrite JobMaster in Python 🏃🏻‍♀️

Scheduler & AutoScale:

  • PlacementGroup-Pipeline First ⚠️
  • Pod-Wise-Random-Scheduler ✅
  • Random Scheduler ⚠️
  • United Distributed Controller ⚠️🏃🏻‍♀️
  • Rescale Adaption ⚠️🏃🏻‍♀️

Reliability/Fault Tolerance

Reliability:

  • Checkpoint Sync ✅
  • Checkpoint Async ⚠️🏃🏻‍♀️
  • At Least Once ✅
  • Exactly Once ⚠️
  • Exactly Same ⚠️

Fusion Training

Training:

  • CircularBuffer shared with tensorflow/pytorch reader ⚠️
  • Parameter Server Scheduler ⚠️
  • Evaluator Scheduler ⚠️
  • Parameter Server AutoPartition ⚠️

Advanced Functions

  • Distributed RPC over DAG ⚠️
  • Metrics & Profiling ⚠️
@ashione ashione pinned this issue Jan 20, 2022
@ashione ashione added the good first issue Good for newcomers label Jan 20, 2022
@ashione ashione changed the title Contributor Guideline[Current Statation & Future Plan] Contributor Guideline[Current Status Quo & Future Plan] Jan 20, 2022
@ashione ashione self-assigned this Feb 21, 2022
@ashione ashione changed the title Contributor Guideline[Current Status Quo & Future Plan] Contributor Guideline & Design Doc Collections[Current Status Quo & Future Plan] Feb 21, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant