feifei_look_transformers
1
license:apache-2.0
by
aifeifei798
Language Model
OTHER
New
0 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
Unknown
Mobile
Laptop
Server
Quick Summary
AI model with specialized capabilities.
Code Examples
bash
python final_report.py
๐ ๅฏๅจ็ปๆๅณ็ญ้พๅ
จๆฏๆฅๅ็ๆๅจ...
๐ ๆต่ฏ Prompt: 'you are fox,give say a ...'
Loading weights: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 236/236 [00:00<00:00, 3297.84it/s, Materializing param=model.norm.weight]
================================================================================
๐ ๅผๅงๅฏนๆจกๅ [Base-IT (่้ป็)] ่ฟ่ก็ปๆๅณ็ญ้พๅฎก่ฎก
================================================================================
[้ถๆฎต 1 & 2] ไป่พๅ
ฅๅฐ Layer 18 Raw (้จ้จไธป็ฎก็ๆ็ปๆๆกๅฝขๆ่ฟ็จ)
--------------------------------------------------------------------------------
่ฟๆฏๆฏไธๅฑ่ฎก็ฎๅฎๆฏๅ๏ผๆช็ปไปปไฝไฟฎๆญฃ็โๅๅงๅฟตๅคดโ๏ผ
- Embed (Raw) : ๆๅฏ่ฝ็่ฏๆฏ [\n] (100.0%)
- L-1 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [เธเธฒเธฐ] (89.1%)
- L-2 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [is] (86.7%)
- L-3 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [setPrototypeOf] (100.0%)
- L-4 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (100.0%)
- L-5 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (98.0%)
- L-6 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โฌ] (100.0%)
- L-7 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-8 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-9 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-10 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-11 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-12 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-13 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-14 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-15 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-16 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-17 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-18 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [I] (82.8%)
--------------------------------------------------------------------------------
[้ถๆฎต 3] Layer 18 Raw -> Final Norm (ๆๆฏๆป็ๅฎกๆฅๅนถไฟฎๆนๆๆก)
--------------------------------------------------------------------------------
1. ้จ้จไธป็ฎก (L-18 Raw) ๆไบค็ๅๅงๆๆก็ฟป่ฏๅฆไธ:
- Rank 1: [I] ๆฆ็: 82.81%
- Rank 2: [Okay] ๆฆ็: 10.55%
- Rank 3: [<end_of_turn>] ๆฆ็: 2.32%
- Rank 4: [Alright] ๆฆ็: 0.55%
- Rank 5: [Under] ๆฆ็: 0.49%
2. ๆๆฏๆป็ (Final Norm) ๅฏนๆๆกๅ้่ฟ่กไบไฟฎๆญฃใ
(ๅ้ๆนๅๅ็งปๅบฆ: 0.7734, 1.0 ่กจ็คบๆชไฟฎๆญฃ)
--------------------------------------------------------------------------------
[้ถๆฎต 4] Normalized Vector -> LM Head (็งไนฆๅคๅฐไฟฎๆนๅ็ๆๆก็ฟป่ฏๆๅ
ทไฝๆนๆก)
--------------------------------------------------------------------------------
ๆๆฏๆป็ไฟฎๆญฃๅ็ๆๆก๏ผ็ป็งไนฆๅค็ฟป่ฏ๏ผๅ
ๅฎนๅไธบ:
- Rank 1: [Warm] ๆฆ็: 96.88%
- Rank 2: [เปเบ] ๆฆ็: 1.78%
- Rank 3: [Resource] ๆฆ็: 1.08%
- Rank 4: [ asistente] ๆฆ็: 0.04%
- Rank 5: [Flowers] ๆฆ็: 0.03%
--------------------------------------------------------------------------------
[้ถๆฎต 5] CEO (Decoding Strategy) ็ปๅๆๆไฟกๆฏๅๅบๆ็ป่ฃๅณ
--------------------------------------------------------------------------------
1. CEO ๅจๅๅณๅฎๅ๏ผๅ่็ๆ็ปๆฆ็ๅๅธ (outputs.logits) ๆฏ:
- Rank 1: [I] ๆฆ็: 82.81%
- Rank 2: [Okay] ๆฆ็: 10.55%
- Rank 3: [<end_of_turn>] ๆฆ็: 2.32%
- Rank 4: [Alright] ๆฆ็: 0.55%
- Rank 5: [Under] ๆฆ็: 0.49%
2. ็ป่ฟๅฏนไธไธๆใ้ฃ้ฉๅ่ฟ่ดฏๆง็ๆ็ปๆ่กก๏ผCEO ๅ่กจไบๅ
ฌๅผๅฃฐๆ:
The following generation flags are not valid and may be ignored: ['top_p', 'top_k']. Set `TRANSFORMERS_VERBOSITY=info` for more details.
Setting `pad_token_id` to `eos_token_id`:1 for open-end generation.
>>> I am Gemma, an AI language model. I can generate text in various formats, including poems, stories, code, and more. I'm here to help you with whatever you need! Tell me what you want.
--------------------------------------------------------------------------------
โ
ๆจกๅ [Base-IT (่้ป็)] ๅณ็ญ้พๅฎก่ฎกๅฎๆใ
Loading weights: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 236/236 [00:00<00:00, 3059.68it/s, Materializing param=model.norm.weight]
================================================================================
๐ ๅผๅงๅฏนๆจกๅ [FT (็ๅทฅไปๅ
ฅ)] ่ฟ่ก็ปๆๅณ็ญ้พๅฎก่ฎก
================================================================================
[้ถๆฎต 1 & 2] ไป่พๅ
ฅๅฐ Layer 18 Raw (้จ้จไธป็ฎก็ๆ็ปๆๆกๅฝขๆ่ฟ็จ)
--------------------------------------------------------------------------------
่ฟๆฏๆฏไธๅฑ่ฎก็ฎๅฎๆฏๅ๏ผๆช็ปไปปไฝไฟฎๆญฃ็โๅๅงๅฟตๅคดโ๏ผ
- Embed (Raw) : ๆๅฏ่ฝ็่ฏๆฏ [\n] (100.0%)
- L-1 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [เธเธฒเธฐ] (86.7%)
- L-2 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [is] (91.0%)
- L-3 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [setPrototypeOf] (100.0%)
- L-4 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (100.0%)
- L-5 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [ เฆจเฆฟเฆฆเฆฐเงเฆถเฆจ] (97.7%)
- L-6 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โฌ] (100.0%)
- L-7 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-8 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-9 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-10 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-11 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-12 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-13 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-14 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-15 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-16 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-17 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [โ] (100.0%)
- L-18 (RAW) : ๆๅฏ่ฝ็่ฏๆฏ [I] (68.4%)
--------------------------------------------------------------------------------
[้ถๆฎต 3] Layer 18 Raw -> Final Norm (ๆๆฏๆป็ๅฎกๆฅๅนถไฟฎๆนๆๆก)
--------------------------------------------------------------------------------
1. ้จ้จไธป็ฎก (L-18 Raw) ๆไบค็ๅๅงๆๆก็ฟป่ฏๅฆไธ:
- Rank 1: [I] ๆฆ็: 68.36%
- Rank 2: [Okay] ๆฆ็: 14.16%
- Rank 3: [<end_of_turn>] ๆฆ็: 8.45%
- Rank 4: [Alright] ๆฆ็: 1.31%
- Rank 5: [ะ] ๆฆ็: 0.66%
2. ๆๆฏๆป็ (Final Norm) ๅฏนๆๆกๅ้่ฟ่กไบไฟฎๆญฃใ
(ๅ้ๆนๅๅ็งปๅบฆ: 0.7891, 1.0 ่กจ็คบๆชไฟฎๆญฃ)
--------------------------------------------------------------------------------
[้ถๆฎต 4] Normalized Vector -> LM Head (็งไนฆๅคๅฐไฟฎๆนๅ็ๆๆก็ฟป่ฏๆๅ
ทไฝๆนๆก)
--------------------------------------------------------------------------------
ๆๆฏๆป็ไฟฎๆญฃๅ็ๆๆก๏ผ็ป็งไนฆๅค็ฟป่ฏ๏ผๅ
ๅฎนๅไธบ:
- Rank 1: [Coffee] ๆฆ็: 80.08%
- Rank 2: [Resource] ๆฆ็: 10.84%
- Rank 3: [Assistant] ๆฆ็: 8.45%
- Rank 4: [ asistente] ๆฆ็: 0.25%
- Rank 5: [Waiting] ๆฆ็: 0.20%
--------------------------------------------------------------------------------
[้ถๆฎต 5] CEO (Decoding Strategy) ็ปๅๆๆไฟกๆฏๅๅบๆ็ป่ฃๅณ
--------------------------------------------------------------------------------
1. CEO ๅจๅๅณๅฎๅ๏ผๅ่็ๆ็ปๆฆ็ๅๅธ (outputs.logits) ๆฏ:
- Rank 1: [I] ๆฆ็: 68.36%
- Rank 2: [Okay] ๆฆ็: 14.16%
- Rank 3: [<end_of_turn>] ๆฆ็: 8.45%
- Rank 4: [Alright] ๆฆ็: 1.31%
- Rank 5: [ะ] ๆฆ็: 0.66%
2. ็ป่ฟๅฏนไธไธๆใ้ฃ้ฉๅ่ฟ่ดฏๆง็ๆ็ปๆ่กก๏ผCEO ๅ่กจไบๅ
ฌๅผๅฃฐๆ:
Setting `pad_token_id` to `eos_token_id`:1 for open-end generation.
>>> I am Gemma, an AI language model. I can generate text and answer your questions in a variety of ways. I'm here to help you with whatever you need! Tell me what you want.
--------------------------------------------------------------------------------
โ
ๆจกๅ [FT (็ๅทฅไปๅ
ฅ)] ๅณ็ญ้พๅฎก่ฎกๅฎๆใ
================================================================================
๐ ๆๆๅฎก่ฎกๅทฅไฝๅทฒๅฎๆใ
================================================================================Deploy This Model
Production-ready deployment in minutes
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIReplicate
One-click model deployment
Run models in the cloud with simple API. No DevOps required.
Deploy NowDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.