feifei_look_transformers

1
license:apache-2.0
by
aifeifei798
Language Model
OTHER
New
0 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
Unknown
Mobile
Laptop
Server
Quick Summary

AI model with specialized capabilities.

Code Examples

bash
python final_report.py 
๐Ÿš€ ๅฏๅŠจ็ปˆๆžๅ†ณ็ญ–้“พๅ…จๆ™ฏๆŠฅๅ‘Š็”Ÿๆˆๅ™จ...
๐Ÿ“ ๆต‹่ฏ• Prompt: 'you are fox,give say a ...'
Loading weights: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 236/236 [00:00<00:00, 3297.84it/s, Materializing param=model.norm.weight]

================================================================================
๐Ÿ“„ ๅผ€ๅง‹ๅฏนๆจกๅž‹ [Base-IT (่€้ป„็‰›)] ่ฟ›่กŒ็ปˆๆžๅ†ณ็ญ–้“พๅฎก่ฎก
================================================================================

[้˜ถๆฎต 1 & 2] ไปŽ่พ“ๅ…ฅๅˆฐ Layer 18 Raw (้ƒจ้—จไธป็ฎก็š„ๆœ€็ปˆๆๆกˆๅฝขๆˆ่ฟ‡็จ‹)
--------------------------------------------------------------------------------
่ฟ™ๆ˜ฏๆฏไธ€ๅฑ‚่ฎก็ฎ—ๅฎŒๆฏ•ๅŽ๏ผŒๆœช็ปไปปไฝ•ไฟฎๆญฃ็š„โ€œๅŽŸๅง‹ๅฟตๅคดโ€๏ผš
  - Embed (Raw) : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [\n] (100.0%)
  - L-1 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [เธžเธฒเธฐ] (89.1%)
  - L-2 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [is] (86.7%)
  - L-3 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [setPrototypeOf] (100.0%)
  - L-4 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (100.0%)
  - L-5 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (98.0%)
  - L-6 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€ฌ] (100.0%)
  - L-7 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-8 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-9 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-10 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-11 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-12 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-13 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-14 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-15 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-16 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-17 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-18 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [I] (82.8%)
--------------------------------------------------------------------------------

[้˜ถๆฎต 3] Layer 18 Raw -> Final Norm (ๆŠ€ๆœฏๆ€ป็›‘ๅฎกๆŸฅๅนถไฟฎๆ”นๆๆกˆ)
--------------------------------------------------------------------------------
1. ้ƒจ้—จไธป็ฎก (L-18 Raw) ๆไบค็š„ๅŽŸๅง‹ๆๆกˆ็ฟป่ฏ‘ๅฆ‚ไธ‹:
    - Rank 1: [I] 	 ๆฆ‚็އ: 82.81%
    - Rank 2: [Okay] 	 ๆฆ‚็އ: 10.55%
    - Rank 3: [<end_of_turn>] 	 ๆฆ‚็އ: 2.32%
    - Rank 4: [Alright] 	 ๆฆ‚็އ: 0.55%
    - Rank 5: [Under] 	 ๆฆ‚็އ: 0.49%

2. ๆŠ€ๆœฏๆ€ป็›‘ (Final Norm) ๅฏนๆๆกˆๅ‘้‡่ฟ›่กŒไบ†ไฟฎๆญฃใ€‚
   (ๅ‘้‡ๆ–นๅ‘ๅ็งปๅบฆ: 0.7734, 1.0 ่กจ็คบๆœชไฟฎๆญฃ)
--------------------------------------------------------------------------------

[้˜ถๆฎต 4] Normalized Vector -> LM Head (็ง˜ไนฆๅค„ๅฐ†ไฟฎๆ”นๅŽ็š„ๆๆกˆ็ฟป่ฏ‘ๆˆๅ…ทไฝ“ๆ–นๆกˆ)
--------------------------------------------------------------------------------
ๆŠ€ๆœฏๆ€ป็›‘ไฟฎๆญฃๅŽ็š„ๆๆกˆ๏ผŒ็ป็ง˜ไนฆๅค„็ฟป่ฏ‘๏ผŒๅ†…ๅฎนๅ˜ไธบ:
    - Rank 1: [Warm] 	 ๆฆ‚็އ: 96.88%
    - Rank 2: [เป€เบž] 	 ๆฆ‚็އ: 1.78%
    - Rank 3: [Resource] 	 ๆฆ‚็އ: 1.08%
    - Rank 4: [ asistente] 	 ๆฆ‚็އ: 0.04%
    - Rank 5: [Flowers] 	 ๆฆ‚็އ: 0.03%
--------------------------------------------------------------------------------

[้˜ถๆฎต 5] CEO (Decoding Strategy) ็ป“ๅˆๆ‰€ๆœ‰ไฟกๆฏๅšๅ‡บๆœ€็ปˆ่ฃๅ†ณ
--------------------------------------------------------------------------------
1. CEO ๅœจๅšๅ†ณๅฎšๅ‰๏ผŒๅ‚่€ƒ็š„ๆœ€็ปˆๆฆ‚็އๅˆ†ๅธƒ (outputs.logits) ๆ˜ฏ:
    - Rank 1: [I] 	 ๆฆ‚็އ: 82.81%
    - Rank 2: [Okay] 	 ๆฆ‚็އ: 10.55%
    - Rank 3: [<end_of_turn>] 	 ๆฆ‚็އ: 2.32%
    - Rank 4: [Alright] 	 ๆฆ‚็އ: 0.55%
    - Rank 5: [Under] 	 ๆฆ‚็އ: 0.49%

2. ็ป่ฟ‡ๅฏนไธŠไธ‹ๆ–‡ใ€้ฃŽ้™ฉๅ’Œ่ฟž่ดฏๆ€ง็š„ๆœ€็ปˆๆƒ่กก๏ผŒCEO ๅ‘่กจไบ†ๅ…ฌๅผ€ๅฃฐๆ˜Ž:
The following generation flags are not valid and may be ignored: ['top_p', 'top_k']. Set `TRANSFORMERS_VERBOSITY=info` for more details.
Setting `pad_token_id` to `eos_token_id`:1 for open-end generation.
   >>> I am Gemma, an AI language model. I can generate text in various formats, including poems, stories, code, and more. I'm here to help you with whatever you need! Tell me what you want.
--------------------------------------------------------------------------------
โœ… ๆจกๅž‹ [Base-IT (่€้ป„็‰›)] ๅ†ณ็ญ–้“พๅฎก่ฎกๅฎŒๆˆใ€‚
Loading weights: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 236/236 [00:00<00:00, 3059.68it/s, Materializing param=model.norm.weight]

================================================================================
๐Ÿ“„ ๅผ€ๅง‹ๅฏนๆจกๅž‹ [FT (็›‘ๅทฅไป‹ๅ…ฅ)] ่ฟ›่กŒ็ปˆๆžๅ†ณ็ญ–้“พๅฎก่ฎก
================================================================================

[้˜ถๆฎต 1 & 2] ไปŽ่พ“ๅ…ฅๅˆฐ Layer 18 Raw (้ƒจ้—จไธป็ฎก็š„ๆœ€็ปˆๆๆกˆๅฝขๆˆ่ฟ‡็จ‹)
--------------------------------------------------------------------------------
่ฟ™ๆ˜ฏๆฏไธ€ๅฑ‚่ฎก็ฎ—ๅฎŒๆฏ•ๅŽ๏ผŒๆœช็ปไปปไฝ•ไฟฎๆญฃ็š„โ€œๅŽŸๅง‹ๅฟตๅคดโ€๏ผš
  - Embed (Raw) : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [\n] (100.0%)
  - L-1 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [เธžเธฒเธฐ] (86.7%)
  - L-2 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [is] (91.0%)
  - L-3 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [setPrototypeOf] (100.0%)
  - L-4 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (100.0%)
  - L-5 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (97.7%)
  - L-6 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€ฌ] (100.0%)
  - L-7 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-8 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-9 (RAW)   : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-10 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-11 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-12 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-13 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-14 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-15 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-16 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-17 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [โ€Œ] (100.0%)
  - L-18 (RAW)  : ๆœ€ๅฏ่ƒฝ็š„่ฏๆ˜ฏ [I] (68.4%)
--------------------------------------------------------------------------------

[้˜ถๆฎต 3] Layer 18 Raw -> Final Norm (ๆŠ€ๆœฏๆ€ป็›‘ๅฎกๆŸฅๅนถไฟฎๆ”นๆๆกˆ)
--------------------------------------------------------------------------------
1. ้ƒจ้—จไธป็ฎก (L-18 Raw) ๆไบค็š„ๅŽŸๅง‹ๆๆกˆ็ฟป่ฏ‘ๅฆ‚ไธ‹:
    - Rank 1: [I] 	 ๆฆ‚็އ: 68.36%
    - Rank 2: [Okay] 	 ๆฆ‚็އ: 14.16%
    - Rank 3: [<end_of_turn>] 	 ๆฆ‚็އ: 8.45%
    - Rank 4: [Alright] 	 ๆฆ‚็އ: 1.31%
    - Rank 5: [ะž] 	 ๆฆ‚็އ: 0.66%

2. ๆŠ€ๆœฏๆ€ป็›‘ (Final Norm) ๅฏนๆๆกˆๅ‘้‡่ฟ›่กŒไบ†ไฟฎๆญฃใ€‚
   (ๅ‘้‡ๆ–นๅ‘ๅ็งปๅบฆ: 0.7891, 1.0 ่กจ็คบๆœชไฟฎๆญฃ)
--------------------------------------------------------------------------------

[้˜ถๆฎต 4] Normalized Vector -> LM Head (็ง˜ไนฆๅค„ๅฐ†ไฟฎๆ”นๅŽ็š„ๆๆกˆ็ฟป่ฏ‘ๆˆๅ…ทไฝ“ๆ–นๆกˆ)
--------------------------------------------------------------------------------
ๆŠ€ๆœฏๆ€ป็›‘ไฟฎๆญฃๅŽ็š„ๆๆกˆ๏ผŒ็ป็ง˜ไนฆๅค„็ฟป่ฏ‘๏ผŒๅ†…ๅฎนๅ˜ไธบ:
    - Rank 1: [Coffee] 	 ๆฆ‚็އ: 80.08%
    - Rank 2: [Resource] 	 ๆฆ‚็އ: 10.84%
    - Rank 3: [Assistant] 	 ๆฆ‚็އ: 8.45%
    - Rank 4: [ asistente] 	 ๆฆ‚็އ: 0.25%
    - Rank 5: [Waiting] 	 ๆฆ‚็އ: 0.20%
--------------------------------------------------------------------------------

[้˜ถๆฎต 5] CEO (Decoding Strategy) ็ป“ๅˆๆ‰€ๆœ‰ไฟกๆฏๅšๅ‡บๆœ€็ปˆ่ฃๅ†ณ
--------------------------------------------------------------------------------
1. CEO ๅœจๅšๅ†ณๅฎšๅ‰๏ผŒๅ‚่€ƒ็š„ๆœ€็ปˆๆฆ‚็އๅˆ†ๅธƒ (outputs.logits) ๆ˜ฏ:
    - Rank 1: [I] 	 ๆฆ‚็އ: 68.36%
    - Rank 2: [Okay] 	 ๆฆ‚็އ: 14.16%
    - Rank 3: [<end_of_turn>] 	 ๆฆ‚็އ: 8.45%
    - Rank 4: [Alright] 	 ๆฆ‚็އ: 1.31%
    - Rank 5: [ะž] 	 ๆฆ‚็އ: 0.66%

2. ็ป่ฟ‡ๅฏนไธŠไธ‹ๆ–‡ใ€้ฃŽ้™ฉๅ’Œ่ฟž่ดฏๆ€ง็š„ๆœ€็ปˆๆƒ่กก๏ผŒCEO ๅ‘่กจไบ†ๅ…ฌๅผ€ๅฃฐๆ˜Ž:
Setting `pad_token_id` to `eos_token_id`:1 for open-end generation.
   >>> I am Gemma, an AI language model. I can generate text and answer your questions in a variety of ways. I'm here to help you with whatever you need! Tell me what you want.
--------------------------------------------------------------------------------
โœ… ๆจกๅž‹ [FT (็›‘ๅทฅไป‹ๅ…ฅ)] ๅ†ณ็ญ–้“พๅฎก่ฎกๅฎŒๆˆใ€‚


================================================================================
๐ŸŽ‰ ๆ‰€ๆœ‰ๅฎก่ฎกๅทฅไฝœๅทฒๅฎŒๆˆใ€‚
================================================================================

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.