41bb4d6b29
Player settings panel, stats API, and README document how saved and new tracks mix under a per-day Lyria cap. Co-authored-by: Cursor <cursoragent@cursor.com>
176 lines
6.5 KiB
Markdown
176 lines
6.5 KiB
Markdown
# Live Ozan Radio
|
|
|
|
Personal AI radio — **no catalog music, ever**. DeepSeek is the DJ. Google **Lyria 3** composes every track. Taste comes from `settings.json` (and optionally Spotify).
|
|
|
|
Inspired by [Magenta RealTime 2](https://magenta.withgoogle.com/magenta-realtime-2) (live, ~200ms) and [Lyria 3](https://deepmind.google/models/lyria/) (full songs via Gemini API). On Mac you can layer MRT2 for true live improvisation; this repo ships the cross-platform Lyria + DeepSeek stack first.
|
|
|
|
## Stack
|
|
|
|
| Layer | Product | Role |
|
|
|-------|---------|------|
|
|
| DJ brain | DeepSeek (Tinqs proxy or BYOK) | Mood, prompts, chat, variety |
|
|
| Music engine | Google Lyria 3 Pro / Clip | Generate MP3 tracks |
|
|
| Taste | `settings.json` + `taste_seeds.json` | Genres, mood, instruments — DJ reads every request |
|
|
| Taste (optional) | Spotify Web API | Top artists, genres — never plays Spotify |
|
|
| Player | FastAPI + `gateway/player.html` | Stream queue, library, chat, dashboard |
|
|
| Live (optional) | Magenta RealTime 2 | Apple Silicon only — see below |
|
|
|
|
## Player dashboard
|
|
|
|
Open **http://127.0.0.1:8787/player** after starting the server.
|
|
|
|
| Feature | What it does |
|
|
|---------|----------------|
|
|
| **Cost dashboard** | Today's estimated spend, per-track cost, songs generated vs daily cap |
|
|
| **Settings (gear icon)** | Shuffle mode, mix saved + new tracks, new-song chance, max songs/day |
|
|
| **Shuffle mode** | On track end or Skip, picks a random saved song or composes a new one |
|
|
| **Saved songs** | Click any track in the library to play |
|
|
| **DJ chat** | Talk to DeepSeek — requests can trigger new Lyria generations |
|
|
|
|
### Shuffle behaviour
|
|
|
|
When shuffle is on (default):
|
|
|
|
1. If you have saved songs **and** daily quota remains, each next track has a `new_song_chance` (default 35%) of being freshly composed.
|
|
2. Otherwise a random saved song plays (never the same track twice in a row if alternatives exist).
|
|
3. If the library is empty, it generates until the daily cap is hit.
|
|
|
|
Daily generation stats live in `songs/stats.json` (gitignored, local runtime only).
|
|
|
|
## Taste (`settings.json`)
|
|
|
|
Edit `settings.json` at the repo root. The DJ reloads it on every generate and chat.
|
|
|
|
```json
|
|
{
|
|
"taste": { "summary": "...", "genres": [], "mood": [], "instruments": [] },
|
|
"playback": {
|
|
"shuffle": true,
|
|
"mix_existing_and_new": true,
|
|
"new_song_chance": 0.35
|
|
},
|
|
"limits": { "max_new_songs_per_day": 10 },
|
|
"costs": {
|
|
"lyria_pro_usd": 0.08,
|
|
"lyria_clip_usd": 0.04,
|
|
"deepseek_per_track_usd": 0.002
|
|
}
|
|
}
|
|
```
|
|
|
|
Default taste profile: **ethnic world dubtronica** (global roots + dub space + electronic groove).
|
|
|
|
The player settings panel PATCHes `playback` and `limits` via `/api/settings` and writes back to this file.
|
|
|
|
## Saved songs
|
|
|
|
Every track is written to `./songs/` and **committed via Git LFS** (audio) + plain git (metadata):
|
|
|
|
| File | Storage | Contents |
|
|
|------|---------|----------|
|
|
| `{id}_{title}.mp3` | LFS | Audio |
|
|
| `{id}.meta.json` | git | Title, mood, DJ line, prompt, lyrics, timestamp |
|
|
| `manifest.json` | gitignored | Runtime queue index |
|
|
|
|
Browse in the player under **Saved songs**, or `GET /api/songs`. After clone: `git lfs install` then `git lfs pull`.
|
|
|
|
## Quick start (Forge / Windows)
|
|
|
|
```powershell
|
|
cd live-ozan-radio
|
|
python -m venv .venv
|
|
.venv\Scripts\activate
|
|
pip install -e .
|
|
copy .env.example .env
|
|
# Fill GEMINI_API_KEY + DEEPSEEK_API_KEY (and Spotify if you have them)
|
|
|
|
$env:DEEPSEEK_BASE_URL = "https://api.deepseek.com/v1"
|
|
python -m ozan_radio serve
|
|
# Open http://127.0.0.1:8787/player
|
|
```
|
|
|
|
One-shot track (no server):
|
|
|
|
```powershell
|
|
python -m ozan_radio generate
|
|
```
|
|
|
|
## Environment
|
|
|
|
| Variable | Required | Notes |
|
|
|----------|----------|-------|
|
|
| `GEMINI_API_KEY` | Yes | [Google AI Studio](https://aistudio.google.com/apikey) — Lyria 3 |
|
|
| `DEEPSEEK_API_KEY` | Yes | Tinqs proxy token or DeepSeek direct |
|
|
| `DEEPSEEK_BASE_URL` | No | Default `https://api.deepseek.com/v1` |
|
|
| `SPOTIFY_*` | No | Refresh token flow — taste only |
|
|
| `LYRIA_MODEL` | No | `lyria-3-pro-preview` (default) or `lyria-3-clip-preview` |
|
|
| `RADIO_OUTPUT_DIR` | No | Default `./songs` |
|
|
|
|
### Cost estimates (defaults)
|
|
|
|
| Model | ~USD / track |
|
|
|-------|----------------|
|
|
| Lyria 3 Pro + DeepSeek | ~$0.082 |
|
|
| Lyria 3 Clip | ~$0.04 |
|
|
|
|
At the default cap of 10 new songs/day with Lyria Pro, projected max spend is **~$0.82/day**. Adjust `costs` and `limits` in `settings.json` or the player settings panel.
|
|
|
|
### Spotify setup (optional taste)
|
|
|
|
1. Create an app at [Spotify Developer Dashboard](https://developer.spotify.com/dashboard).
|
|
2. Add redirect URI `http://127.0.0.1:8888/callback`.
|
|
3. Complete OAuth once to obtain a refresh token (scope: `user-top-read`).
|
|
4. Paste `SPOTIFY_CLIENT_ID`, `SPOTIFY_CLIENT_SECRET`, `SPOTIFY_REFRESH_TOKEN` into `.env`.
|
|
|
|
If Spotify is not configured, the DJ uses `settings.json` + `taste_seeds.json`.
|
|
|
|
## API
|
|
|
|
| Method | Path | Description |
|
|
|--------|------|-------------|
|
|
| GET | `/api/now` | Current track metadata |
|
|
| GET | `/api/queue` | Full queue |
|
|
| GET | `/api/stats` | Dashboard: today's spend, quota, cost estimates |
|
|
| GET | `/api/settings` | Playback, limits, costs |
|
|
| PATCH | `/api/settings` | Update shuffle, daily cap, etc. |
|
|
| POST | `/api/generate` | DJ plans + Lyria renders next track |
|
|
| POST | `/api/shuffle/next` | Smart next: library shuffle or new generation |
|
|
| POST | `/api/skip` | Advance (uses shuffle when enabled) |
|
|
| GET/POST | `/api/chat` | DJ chat log and messages |
|
|
| GET | `/api/songs` | Saved library |
|
|
| POST | `/api/songs/{id}/play` | Play a saved track |
|
|
| GET | `/stream/{file}` | MP3 stream |
|
|
| GET | `/player` | Web UI |
|
|
|
|
## Magenta RealTime 2 (optional live layer)
|
|
|
|
On **Apple Silicon** (Kraken), install [magenta-rt](https://github.com/magenta/magenta-realtime) for sub-second live generation:
|
|
|
|
```bash
|
|
uv pip install "magenta-rt[mlx]"
|
|
mrt models init && mrt models download
|
|
mrt mlx generate --prompt "disco funk" --duration 4.0 --model=mrt2_small
|
|
```
|
|
|
|
Wire MRT2 as a bridge between tracks or as a live “bed” under the Lyria queue — PRs welcome.
|
|
|
|
## Repo
|
|
|
|
Public on Git Studio: **https://tinqs.com/tinqs/live-radio**
|
|
|
|
```bash
|
|
git clone git@ssh.tinqs.com:tinqs/live-radio.git
|
|
cd live-radio
|
|
git lfs install && git lfs pull
|
|
```
|
|
|
|
Static player preview: `https://tinqs.com/tinqs/live-radio/src/branch/main/gateway/player.html` (shell only — audio streams from your running server).
|
|
|
|
## Agent usage
|
|
|
|
DeepSeek (Pi, Cursor, Claude Code) can operate the station via HTTP or the skill in `.cursor/skills/ozan-radio/SKILL.md`.
|
|
|
|
## License
|
|
|
|
Apache 2.0 — same spirit as Magenta RealTime 2 open weights.
|