SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

👀 TL;DR

SwiReasoning is a training-free method for Pareto-superior reasoning LLMs that dynamically switches between explicit and latent thinking, with a switch count control mechanism to suppress overthinking.

demo.mp4

Comparison of solving the same question with the same reasoning LLM (6s vs. 1min).

⚙️ Getting Started

Clone the project

git clone https://github.com/sdc17/SwiReasoning.git
cd SwiReasoning

Environment setup

conda create -n swir python=3.12
conda activate swir
pip install -r requirements.txt

💻 Interactive Chat

python run_chat.py --model_name Qwen/Qwen3-8B --method swir --max_switch_count 2

Modify --model_name to try different reasoning LLMs.
Increase --max_switch_count to allow more thinking rounds (default: 2).

Commands:
  exit or q -> [Exit]
  switch <N|none> -> [Set] swir max_switch_count = N (integer >= 1) or None (disabled)
  method <swir|cot|cot_greedy> -> [Set] generation method

Please check run_chat.sh for more examples.

📈 Evaluation

# Evaluate without switch count control
torchrun --nproc_per_node 1 --nnodes 1 --node_rank 0 --master_port $((RANDOM + 20000)) run.py --model_name Qwen/Qwen3-1.7B \
    --dataset_name gsm8k --batch_size 512 --max_new_tokens 32768 --method swir --alpha 0.6
python merge.py --model_name Qwen/Qwen3-1.7B --dataset_name gsm8k --max_new_tokens 32768 --method swir

# Evaluate with switch count control
torchrun --nproc_per_node 1 --nnodes 1 --node_rank 0 --master_port $((RANDOM + 20000)) run.py --model_name Qwen/Qwen3-8B \
    --dataset_name gsm8k --batch_size 256 --max_new_tokens 32768 --method swir --alpha 0.5 --max_switch_count 2
python merge.py --model_name Qwen/Qwen3-8B --dataset_name gsm8k --max_new_tokens 32768 --method swir

Increase --nproc_per_node to enable faster evaluation on multiple GPUs.
Modify --model_name and --dataset_name for evaluation with different models and datasets.
Please check run.sh for more examples.

💬 Acknowledgments

We thank the contributors of open-source projects Transformers, Qwen3, and Soft-Thinking.

✨ BibTeX

@misc{shi2025swireasoningswitchthinkinglatentexplicit,
      title={SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs}, 
      author={Dachuan Shi and Abedelkadir Asi and Keying Li and Xiangchi Yuan and Leyan Pan and Wenke Lee and Wen Xiao},
      year={2025},
      eprint={2510.05069},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2510.05069}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
generation_utils.py		generation_utils.py
grader.py		grader.py
merge.py		merge.py
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh
run_chat.py		run_chat.py
run_chat.sh		run_chat.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

👀 TL;DR

⚙️ Getting Started

Clone the project

Environment setup

💻 Interactive Chat

📈 Evaluation

💬 Acknowledgments

✨ BibTeX

About

Uh oh!

Releases

Packages

Languages

License

sdc17/SwiReasoning

Folders and files

Latest commit

History

Repository files navigation

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

👀 TL;DR

⚙️ Getting Started

Clone the project

Environment setup

💻 Interactive Chat

📈 Evaluation

💬 Acknowledgments

✨ BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages