a simple bash script to run a reverse turing test on a LLM
Reference: https://www.youtube.com/watch?v=MxTWLm9vT_o
| Tested models: | score | acc |
|---|---|---|
| Llama 3.1 8B instruct Q6_K | 4/4 | 1.0 |
| phi 3.5 3.8B 8k instruct Q6_K | 4/4 | 1.0 |
| phi 3 medium 128k instruct 14B IQ4_XS | 4/4 | 1.0 |
| phi 3 mini 128k instruct 3.8B Q6_K | 4/4 | 1.0 |
| solar 10.7B instruct Q6_K | 4/4 | 1.0 |
| solar pro preview instruct 22B IQ4_XS | 4/4 | 1.0 |
| Qwen 2.5 instruct 32B IQ4_XS | 4/4 | 1.0 |
| gemma 2 2B it Q8_0 | 3/4 | 0.75 |
| glm4 9B chat IQ4_XS | 3/4 | 0.75 |
| mistral 7B instruct v0.3 Q6_K | 3/4 | 0.75 |
| EXAONE 3.5 2.4B instruct IQ4_XS | 2/4 | 0.5 |
| EXAONE 3.5 32B instruct IQ4_XS | 2/4 | 0.5 |
| internlm 7B chat Q6_K | 2/4 | 0.5 |
| Llama 3.2 3B instruct Q6_K | 2/4 | 0.5 |
| Ministral 8B Instruct 2410 Q6_K | 2/4 | 0.5 |
| Mistral Small 22B Instruct 2409 Q6_K | 2/4 | 0.5 |
| openchat 7B 0106 Q8_0 | 2/4 | 0.5 |
| phi 3 medium 4k instruct 14B IQ4_XS | 2/4 | 0.5 |
| phi 3 mini 4k instruct 3.8B Q8_0 | 2/4 | 0.5 |
| EXAONE 3.5 7B instruct IQ4_XS | 1/4 | 0.25 |
| Falcon 3 7B instruct IQ4_XS | 1/4 | 0.25 |
| Falcon 3 10B instruct IQ4_XS | 1/4 | 0.25 |
| gemma 2 9B it Q6_K | 1/4 | 0.25 |
| gemma 2 27B it IQ4_XS | 1/4 | 0.25 |
| granite 3.0 8B instruct Q6_K | 1/4 | 0.25 |
| internlm3 8B chat IQ4_XS | 1/4 | 0.25 |
| Mistral Nemo 12B Q4_K_M | 1/4 | 0.25 |
| openchat 3.6 8B Q8_0 | 1/4 | 0.25 |
| phi 3.5 3.8B 128k Q6_K | 1/4 | 0.25 |
| qwen 2 7B Q6_K | 1/4 | 0.25 |
| qwen 2.5 instruct 7B Q6_K | 1/4 | 0.25 |
| Qwen 2.5 instruct 14B IQ4_XS | 1/4 | 0.25 |
| Falcon 3 1B instruct IQ4_XS | 0/4 | 0.0 |
| granite 3.0 2B instruct Q6_K | 0/4 | 0.0 |