Merge pull request #52 from google-ai-edge:smilingday-patch-1

copybara-github · copybara-github · commit b16e7a9e5e57 · 2026-05-20T08:30:17.000-07:00
PiperOrigin-RevId: 918466013
diff --git a/README.md b/README.md
@@ -10,6 +10,10 @@ platforms (desktop, mobile, or cloud).
 [Common commands](#-common-commands) ｜ 📓 [Try Colab](#-try-colab) | 🌟
 [Quick demos](#-quick-demos) | 🤖 [Use in coding agent](#-use-in-coding-agent)
 
+> [!NOTE] It's still an early preview under active development, thus has limited
+> platform and feature support, plus possible bugs. We appreciate your patience
+> and feedback to help us improve it. Welcome issues and PRs!
+
 LiteRT CLI is built on top of [Google AI Edge](https://ai.google.dev/edge)
 stacks, including [LiteRT](https://github.com/google-ai-edge/LiteRT),
 [LiteRT-LM](https://github.com/google-ai-edge/LiteRT-LM),
@@ -18,10 +22,6 @@ stacks, including [LiteRT](https://github.com/google-ai-edge/LiteRT),
 [AI Edge Portal](https://ai.google.dev/edge/ai-edge-portal), and
 [Model Explorer](https://ai.google.dev/edge/model-explorer).
 
-> [!NOTE] It's still an early preview under active development, thus has limited
-> platform and feature support, plus possible bugs. We appreciate your patience
-> and feedback to help us improve it. Welcome issues and PRs!
-
 --------------------------------------------------------------------------------
 
 ## 🚀 Installation
@@ -343,7 +343,34 @@ litert benchmark model.tflite --gcp --device "pixel 7" --gcp-project "your-gcp-p
 litert benchmark model.tflite --gcp --devices "pixel 7, sm-s931u1" --gpu
 ```
 
-### 7. Visualize a model's architecture
+### 7. Run and benchmark a generative LLM model using LiteRT-LM CLI
+
+`litert lm` command will utlitize `litert-lm`, and you can use the same command
+with `litert-lm`, for example, both `litert lm run` and `litert-lm run` or
+`litert lm benchmark` and `litert-lm benchmark` achieve the same results.
+
+Please follow the
+[LiteRT-LM CLI guide](https://ai.google.dev/edge/litert-lm/cli) for detailed
+instructions.
+
+```bash
+# Run a generative LLM model, and load from hugging face
+litert lm run  \
+  --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
+  gemma-4-E2B-it.litertlm \
+  --prompt="What is the capital of France?"
+
+# Or load from a local LLM model file
+litert lm run ./my_model.litertlm
+
+# Example with a custom prompt
+litert lm run ./my_model.litertlm --prompt "Hello, how are you?"
+
+# Benchmark a generative LLM model
+litert lm benchmark ./my_model.litertlm
+```
+
+### 8. Visualize a model's architecture
 
 ```bash
 # Open in Model Explorer graph
@@ -353,7 +380,7 @@ litert visualize model.tflite
 litert visualize --stop-all
 ```
 
-### 8. Import a local model
+### 9. Import a local model
 
 ```bash
 # Import a local file into the centralized cache
@@ -363,7 +390,7 @@ litert import my_model.tflite --model-ref my_model
 litert import ./my_model_dir --model-ref my_model --hf-id my_org_name/my_model
 ```
 
-### 9. List managed models
+### 10. List managed models
 
 ```bash
 # List all managed models
@@ -373,40 +400,12 @@ litert list
 litert list my_model
 ```
 
-### 10. Delete a managed model
+### 11. Delete a managed model
 
 ```bash
 # Delete a model from cache
 litert delete my_model
 ```
-
-### 11. Run and benchmark a generative LLM model using LiteRT-LM CLI
-
-`litert lm` command will utlitize `litert-lm`, and you can use the same command
-with `litert-lm`, for example, both `litert lm run` and `litert-lm run` or
-`litert lm benchmark` and `litert-lm benchmark` achieve the same results.
-
-Please follow the
-[LiteRT-LM CLI guide](https://ai.google.dev/edge/litert-lm/cli) for detailed
-instructions.
-
-```bash
-# Run a generative LLM model, and load from hugging face
-litert lm run  \
-  --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
-  gemma-4-E2B-it.litertlm \
-  --prompt="What is the capital of France?"
-
-# Or load from a local LLM model file
-litert lm run ./my_model.litertlm
-
-# Example with a custom prompt
-litert lm run ./my_model.litertlm --prompt "Hello, how are you?"
-
-# Benchmark a generative LLM model
-litert lm benchmark ./my_model.litertlm
-```
-
 ### 12. Clean up all caches
 
 ```bash