Ryokan Ri's picture

20 1

Ryokan Ri

ryo0634

·

https://ryou0634.github.io/

Ryou0634

AI & ML interests

Multilingual NLP, Pretrained Language Models, Information Retrieval

Organizations

Papers 1

arxiv:2402.11485

models 21

ryo0634/TinySwallow-1.5B-Math-DPO

Text Generation • 2B • Updated Sep 4 • 6

ryo0634/TinySwallow-1.5B-Math-SFT

Text Generation • 2B • Updated Sep 4 • 4

ryo0634/Swallow-7b-hf-oasst1-21k-ja-alert-dpo-100-steps-beta-2e-1

Text Generation • 7B • Updated Aug 6, 2024 • 2

ryo0634/Swallow-7b-hf-oasst1-21k-ja-alert-dpo-100-steps-beta-1e-1

Text Generation • 7B • Updated Aug 6, 2024 • 5

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja-200-steps

Text Generation • 7B • Updated Aug 6, 2024 • 3

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja-safety-150-steps

Text Generation • 7B • Updated Aug 6, 2024 • 3

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja-100-steps

Text Generation • 7B • Updated Aug 6, 2024 • 4

ryo0634/Swallow-7b-hf-oasst1-21k-ja-aio-retriever-200-steps

Text Generation • 7B • Updated Aug 5, 2024 • 5

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja

Text Generation • 7B • Updated Aug 4, 2024 • 5

ryo0634/Swallow-7b-plus-hf-oasst1-21k-ja

Text Generation • 7B • Updated Jul 25, 2024 • 5

datasets 22

ryo0634/gsm8k-ja-noisy-dpo-on-policy-4

Viewer • Updated Sep 4 • 890 • 22

ryo0634/gsm8k-ja-noisy-dpo-on-policy-3

Viewer • Updated Sep 4 • 900 • 39

ryo0634/gsm8k-ja-noisy-dpo-on-policy

Viewer • Updated Sep 3 • 706 • 33

ryo0634/gsm8k-ja-noisy-dpo-on-policy-2

Viewer • Updated Sep 3 • 1.07k • 30

ryo0634/gsm8k-ja-noisy-dpo

Viewer • Updated Sep 3 • 1k • 26

ryo0634/gsm8k-ja-noisy-sft

Viewer • Updated Jul 28 • 1k • 40

ryo0634/gsm8k-ja-filtered-dev

Viewer • Updated Jul 27 • 400 • 22

ryo0634/gsm8k-ja-filtered-sft

Viewer • Updated Jul 27 • 3k • 27

ryo0634/math-short-thought-filtered

Viewer • Updated May 23 • 757 • 14

ryo0634/math-thought-filtered

Viewer • Updated May 23 • 923 • 10

View 22 datasets