doc: Clean up README.md a bit

2025-03-06 22:18:32 -08:00 · 2025-03-06 22:18:32 -08:00 · 5ca70b2933
commit 5ca70b2933
parent 87c2469c65
1 changed files with 15 additions and 29 deletions
--- a/README.md
+++ b/README.md
@ -2,18 +2,15 @@

 AI is Really Cool

-NOTE: If running on an Intel Arc A series graphics processor, fp64 is not supported and may need to either be emulated or have the model quantized.
-
 This project provides a simple IRC chat client. It runs the neuralchat model, enhanced with a little bit of RAG to fetch news RSS feeds.

 Internally, it is built using PyTorch 2.6 and the Intel IPEX/LLM.

+NOTE: If running on an Intel Arc A series graphics processor, fp64 is not supported and may need to either be emulated or have the model quantized. It has been a while since I've had an A series GPU to test on, so if you run into problems please file an [issue](https://github.com/jketreno/airc/issues)--I have some routines I can put in, but don't have a way to test them. 
+
 # Installation

-This project uses docker containers to build. As this was originally
-written to work on an Intel Arc B580 (Battlemage), it requires a
-kernel that supports that hardware, such as the one documented
-at [Intel Graphics Preview](https://github.com/canonical/intel-graphics-preview), which runs in Ubuntu Oracular (24.10)..
+This project uses docker containers to build. As this was originally written to work on an Intel Arc B580 (Battlemage), it requires a kernel that supports that hardware, such as the one documented at [Intel Graphics Preview](https://github.com/canonical/intel-graphics-preview), which runs in Ubuntu Oracular (24.10)..

 NOTE: You need 'docker compose' installed. See [Install Docker Engine on Ubuntu](https://docs.docker.com/engine/install/ubuntu/)

@ -21,8 +18,7 @@ NOTE: You need 'docker compose' installed. See [Install Docker Engine on Ubuntu]

 https://www.intel.com/content/www/us/en/support/articles/000093216/graphics/processor-graphics.html

-The A- and B-series discrete GPUs do not support SR-IOV, required for
-the GPU partitioning that Microsoft Windows uses in order to support GPU acceleration in WSL.
+The A- and B-series discrete GPUs do not support SR-IOV, required for the GPU partitioning that Microsoft Windows uses in order to support GPU acceleration in WSL.

 ## Building

@ -37,9 +33,7 @@ docker compose build

 ## Running

-In order to download the models, you need to have a Hugging Face
-token. See https://huggingface.co/settings/tokens for information
-on obtaining a token.
+In order to download the models, you need to have a Hugging Face token. See https://huggingface.co/settings/tokens for information on obtaining a token.

 Edit .env to add the following:

@ -47,20 +41,17 @@ Edit .env to add the following:
 HF_ACCESS_TOKEN=<access token from huggingface>
 ```

-NOTE: Models downloaded by most examples will be placed in the
-./cache directory, which is bind mounted to the container.
+NOTE: Models downloaded by most examples will be placed in the ./cache directory, which is bind mounted to the container.

 ### AIRC

-To launch the airc shell interactively, with the pytorch 2.6
-environment loaded, use the default entrypoint to launch a shell:
+To launch the airc shell interactively, with the pytorch 2.6 environment loaded, use the default entrypoint to launch a shell:

 ```bash
 docker compose run --rm airc shell
 ```

-Once in the shell, you can then launch the model-server.py and then
-the airc.py client:
+Once in the shell, you can then launch the model-server.py and then the airc.py client:

 ```bash
 docker compose run --rm airc shell
@ -68,12 +59,9 @@ src/airc.py --ai-server=http://localhost:5000 &
 src/model-server.py
 ```

-By default, src/airc.py will connect to irc.libera.chat on the airc-test
-channel. See `python src/airc.py --help` for options.
+By default, src/airc.py will connect to irc.libera.chat on the airc-test channel. See `python src/airc.py --help` for options.

-By separating the model-server into its own process, you can develop
-and tweak the chat backend without losing the IRC connection established
-by airc.
+By separating the model-server into its own process, you can develop and tweak the chat backend without losing the IRC connection established by airc.

 ### Jupyter

@ -81,15 +69,13 @@ by airc.
 docker compose up jupyter -d
 ```

-The default port for inbound connections is 8888 (see docker-compose.yml).
-$(pwd)/jupyter is bind mounted to /opt/juypter in the container, which is where notebooks will be saved by default.
+The default port for inbound connections is 8888 (see docker-compose.yml). $(pwd)/jupyter is bind mounted to /opt/juypter in the container, which is where notebooks will be saved by default.

 To access the jupyter notebook, go to `https://localhost:8888/jupyter`.

 ### Monitoring

-You can run `ze-monitor` within the launched containers to monitor
-GPU usage.
+You can run `ze-monitor` within the launched containers to monitor GPU usage.

 ```bash
 containers=($(docker ps --filter "ancestor=airc" --format "{{.ID}}"))
@ -111,9 +97,9 @@ Device 1: 8086:A780 (Intel(R) UHD Graphics 770)
 Device 2: 8086:E20B (Intel(R) Graphics [0xe20b])
 ```

-You can then launch ze-monitor in that container specifying 
-the device you wish to monitor:
+You can then launch ze-monitor in that container specifying  the device you wish to monitor:

 ```
-docker exec -it 5317c503e771 ze-monitor --device 2
+containers=($(docker ps --filter "ancestor=airc" --format "{{.ID}}"))
+docker exec -it ${containers[0]} ze-monitor --device 2
 ```