Support intern-s1 #14875

RunningLeon · 2025-07-25T11:33:20Z

Support internlm/Intern-S1

convert_hf_to_gguf.py

CISC · 2025-07-29T09:22:11Z

The Python Type-Check CI needs to be resolved.

RunningLeon · 2025-07-30T06:16:05Z

The Python Type-Check CI needs to be resolved.

@CISC hi, could you tell how to fix this error? Seems not reasonable to me

/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3219:23 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3220:13 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3220:49 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:32[21](https://github.com/ggml-org/llama.cpp/actions/runs/16612224904/job/46997396567?pr=14875#step:5:22):23 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3[22](https://github.com/ggml-org/llama.cpp/actions/runs/16612224904/job/46997396567?pr=14875#step:5:23)2:13 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
/home/runner/work/llama.cpp/llama.cpp/convert_hf_to_gguf.py:3222:49 - error: Object of type "None" is not subscriptable (reportOptionalSubscript)
Error: Object of type "None" is not subscriptable (reportOptionalSubscript)
6 errors, 0 warnings, 14 informations
Error: 6 errors

CISC · 2025-07-30T06:40:02Z

The Python Type-Check CI needs to be resolved.

@CISC hi, could you tell how to fix this error? Seems not reasonable to me

Running pyright locally helps, the line numbers are wrong for some reason, this is the actual erroneous codeblock:

llama.cpp/convert_hf_to_gguf.py

Lines 3002 to 3005 in 5eba3e3

    
           if isinstance(self.hparams_vision['image_size'], list): 
        
               self.hparams_vision['image_size'] = self.hparams_vision['image_size'][0] 
        
           if isinstance(self.hparams_vision['patch_size'], list): 
        
               self.hparams_vision['patch_size'] = self.hparams_vision['patch_size'][0]

convert_hf_to_gguf.py

gguf-py/gguf/tensor_mapping.py

convert_hf_to_gguf.py

CISC · 2025-08-07T13:15:40Z

convert_hf_to_gguf.py

+        try:
+            self._set_vocab_sentencepiece()
+        except FileNotFoundError:
+            self._set_vocab_gpt2()


Suggested change

try:

self._set_vocab_sentencepiece()

except FileNotFoundError:

self._set_vocab_gpt2()

super().set_vocab()

CISC · 2025-08-07T13:16:42Z

convert_hf_to_gguf.py

+        special_tokens_map_file = self.dir_model / 'special_tokens_map.json'
+        additional_special_tokens = []
+        if special_tokens_map_file.is_file():
+            with open(special_tokens_map_file, encoding = 'utf-8') as f:
+                additional_special_tokens = json.load(f).get('additional_special_tokens', [])
+        tokenizer_cfg_file = self.dir_model / 'special_tokens_map.json'
+        if tokenizer_cfg_file.is_file():
+            with open(tokenizer_cfg_file, encoding = 'utf-8') as f:
+                added_tokens_decoder = json.load(f).get('added_tokens_decoder', {})
+                token2ids_map = {data['content'] : int(token) for token, data in added_tokens_decoder.items() if data['special']}
+                for token in additional_special_tokens:
+                    if token in token2ids_map:
+                        special_vocab._set_special_token(token, token2ids_map[token])
+        special_vocab._set_special_token('eos', 151645)


Suggested change

special_tokens_map_file = self.dir_model / 'special_tokens_map.json'

additional_special_tokens = []

if special_tokens_map_file.is_file():

with open(special_tokens_map_file, encoding = 'utf-8') as f:

additional_special_tokens = json.load(f).get('additional_special_tokens', [])

tokenizer_cfg_file = self.dir_model / 'special_tokens_map.json'

if tokenizer_cfg_file.is_file():

with open(tokenizer_cfg_file, encoding = 'utf-8') as f:

added_tokens_decoder = json.load(f).get('added_tokens_decoder', {})

token2ids_map = {data['content'] : int(token) for token, data in added_tokens_decoder.items() if data['special']}

for token in additional_special_tokens:

if token in token2ids_map:

special_vocab._set_special_token(token, token2ids_map[token])

special_vocab._set_special_token('eos', 151645)

CISC · 2025-08-07T13:18:30Z

Can merge after changes and @ngxson approves.

CISC · 2025-08-07T18:38:27Z

Hmmm, after changes... :)

ngxson · 2025-08-07T19:43:38Z

woops sorry I missed that part. I guest @RunningLeon you need to open a new PR then

* support internvl * support interns1 * resolve comments * put interns1 in tensor mapping * resolve comment * move tokenizer changes to sub class

RunningLeon added 2 commits July 16, 2025 20:55

support internvl

7cf5c4c

support interns1

859796e

github-actions bot added the python python script changes label Jul 25, 2025

CISC reviewed Jul 25, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

ngxson requested changes Jul 25, 2025

View reviewed changes

convert_hf_to_gguf.py Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

resolve comments

483ffef

put interns1 in tensor mapping

5eba3e3

CISC reviewed Jul 30, 2025

View reviewed changes

convert_hf_to_gguf.py Show resolved Hide resolved

ngxson reviewed Jul 30, 2025

View reviewed changes

convert_hf_to_gguf.py Show resolved Hide resolved

ngxson reviewed Jul 30, 2025

View reviewed changes

gguf-py/gguf/tensor_mapping.py Outdated Show resolved Hide resolved

resolve comment

c71543c

CISC reviewed Jul 31, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

move tokenizer changes to sub class

490a13f

CISC approved these changes Aug 7, 2025

View reviewed changes

ngxson approved these changes Aug 7, 2025

View reviewed changes

ngxson merged commit 99acbc9 into ggml-org:master Aug 7, 2025
6 checks passed

the-phobos pushed a commit to the-phobos/llama.cpp that referenced this pull request Aug 7, 2025

llama : Support intern-s1 (ggml-org#14875)

1baf2b3

* support internvl * support interns1 * resolve comments * put interns1 in tensor mapping * resolve comment * move tokenizer changes to sub class

the-phobos pushed a commit to the-phobos/llama.cpp that referenced this pull request Aug 7, 2025

llama : Support intern-s1 (ggml-org#14875)

d34be62

* support internvl * support interns1 * resolve comments * put interns1 in tensor mapping * resolve comment * move tokenizer changes to sub class

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support intern-s1 #14875

Support intern-s1 #14875

RunningLeon commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC commented Jul 29, 2025

Uh oh!

RunningLeon commented Jul 30, 2025

Uh oh!

CISC commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC Aug 7, 2025

Uh oh!

CISC Aug 7, 2025

Uh oh!

CISC commented Aug 7, 2025

Uh oh!

Uh oh!

CISC commented Aug 7, 2025

Uh oh!

ngxson commented Aug 7, 2025

Uh oh!

Uh oh!

Support intern-s1 #14875

Support intern-s1 #14875

Conversation

RunningLeon commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC commented Jul 29, 2025

Uh oh!

RunningLeon commented Jul 30, 2025

Uh oh!

CISC commented Jul 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CISC Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

CISC Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

CISC commented Aug 7, 2025

Uh oh!

Uh oh!

CISC commented Aug 7, 2025

Uh oh!

ngxson commented Aug 7, 2025

Uh oh!

Uh oh!