Gen-Verse / ReasonFlux Star 351 Code Issues Pull requests ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates reinforcement-learning chain-of-thought llm-rlhf sft-data o1-mini o1-preview deepseek-v3 deepseek-r1 Updated Mar 22, 2025 Python
yiyepiaoling0715 / codellm-data-preprocess-pipeline Star 34 Code Issues Pull requests 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota topological-sort fim function-dependency pretrain-data sft-data codellm-completion Updated Jul 25, 2024 Python
Evil-cyber65 / ReasonFlux Star 0 Code Issues Pull requests ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates reinforcement-learning chain-of-thought llm-rlhf sft-data o1-mini o1-preview deepseek-v3 deepseek-r1 Updated Mar 23, 2025