Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -14,22 +14,22 @@ tags:
|
|
| 14 |
|
| 15 |
| Parameter | Value |
|
| 16 |
| :-------- | :---: |
|
| 17 |
-
| **direction_index** |
|
| 18 |
-
| **attn.o_proj.max_weight** | 1.
|
| 19 |
-
| **attn.o_proj.max_weight_position** |
|
| 20 |
-
| **attn.o_proj.min_weight** |
|
| 21 |
-
| **attn.o_proj.min_weight_distance** |
|
| 22 |
-
| **mlp.down_proj.max_weight** | 1.
|
| 23 |
-
| **mlp.down_proj.max_weight_position** |
|
| 24 |
| **mlp.down_proj.min_weight** | 0.92 |
|
| 25 |
-
| **mlp.down_proj.min_weight_distance** |
|
| 26 |
|
| 27 |
## Performance
|
| 28 |
|
| 29 |
| Metric | This model | Original model ([Qwen/Qwen3-VL-32B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-32B-Instruct)) |
|
| 30 |
| :----- | :--------: | :---------------------------: |
|
| 31 |
-
| **KL divergence** | 0.
|
| 32 |
-
| **Refusals** |
|
| 33 |
|
| 34 |
-----
|
| 35 |
|
|
|
|
| 14 |
|
| 15 |
| Parameter | Value |
|
| 16 |
| :-------- | :---: |
|
| 17 |
+
| **direction_index** | 26.77 |
|
| 18 |
+
| **attn.o_proj.max_weight** | 1.41 |
|
| 19 |
+
| **attn.o_proj.max_weight_position** | 43.65 |
|
| 20 |
+
| **attn.o_proj.min_weight** | 1.27 |
|
| 21 |
+
| **attn.o_proj.min_weight_distance** | 37.41 |
|
| 22 |
+
| **mlp.down_proj.max_weight** | 1.23 |
|
| 23 |
+
| **mlp.down_proj.max_weight_position** | 45.02 |
|
| 24 |
| **mlp.down_proj.min_weight** | 0.92 |
|
| 25 |
+
| **mlp.down_proj.min_weight_distance** | 33.73 |
|
| 26 |
|
| 27 |
## Performance
|
| 28 |
|
| 29 |
| Metric | This model | Original model ([Qwen/Qwen3-VL-32B-Instruct](https://huggingface.co/Qwen/Qwen3-VL-32B-Instruct)) |
|
| 30 |
| :----- | :--------: | :---------------------------: |
|
| 31 |
+
| **KL divergence** | 0.1565 | 0 *(by definition)* |
|
| 32 |
+
| **Refusals** | 7/100 | 99/100 |
|
| 33 |
|
| 34 |
-----
|
| 35 |
|