Commit Graph

33 Commits

Author SHA1 Message Date
an-lee
de89ae73bf Force alignment for tts audio (#418)
* add originalText as param

* save original text when added from tts speech

* fix player in conversation sheet

* minor fix
2024-03-18 12:55:28 +08:00
an-lee
90f38e9226 Transcription force alignment & more (#416)
* add wavesurfer-provider

* brand new layout for player

* refactor pitch contour

* clean up

* update styl

* refactor

* update layout

* use new layout for video

* refactor

* may select word

* may edit word timestamp

* may toggle multiselect words

* clean code

* improve word region update

* improve layout

* update layout

* add echogarden

* fix test

* use aligned transcription

* fix ipa

* some refactor

* improve code

* implement ipa & translate & lookup

* recording play & share

* fix

* fix post audio

* improve layout

* may delete recording

* may record

* fix video player layout

* fix player in conversation

* render recording along with orignal audio

* may custom create region in recording

* fix float issue when seekTo

* fix recording player

* fix load more recordings

* fix seekTo

* clean up

* refactor pitch contour

* fix some warnings

* upgrade deps

* fix group transcription sentence

* zoom to fit when segment update

* add more hotkeys

* update player layout

* improve style

* play recording overlap audio when comparing

* update echogarden dep

* add recorded mark on transcription

* fix recording pitch contour rendering

* improve recording

* adjust pitch finder params
2024-03-16 19:42:37 +08:00
an-lee
ded7371be6 Feat: integrate deposit (#363)
* create deposit payment

* display recent deposits & improve UI

* support Mixin pay
2024-02-28 15:29:07 +08:00
an-lee
74e061ccac fix messages findOne (#342) 2024-02-22 22:17:46 +08:00
an-lee
8037273549 Feat: add tts conversation (#341)
* may create tts type conversation

* support tts reply

* upgrade deps

* test: e2e for create TTS conversation

* test: e2e for gpt conversation

* test: e2e for from create conversation to add speech audio to library

* refactor use-conversation

* generate speech before create msg in tts conversation

* refactor conversation-shorts

* revert change in 1000-hours

* revert sass dep changed in 1000-hours

* fix CI
2024-02-22 16:00:17 +08:00
an-lee
650a00b6a4 refactor transcode method 2024-02-20 09:00:58 +08:00
an-lee
f75912ff8b Feat use ffmpeg static (#327)
* use ffmpeg static if valid

* transcribe after decoded
2024-02-19 14:12:06 +08:00
an-lee
06f8d32169 Feat improve media player page (#320)
* download transcription from api when transcribed

* add friendly loading panel on audio page

* add loading panel for video page

* fix locale

* remove ubuntu 20.04 for e2e test

* do not auto upload recording
2024-02-19 11:01:52 +08:00
an-lee
57d6efa547 Feat: refactor ffmpeg config (#296)
* bundle ffmpeg wasm in package

* remove ffmpeg discover & add ffmpeg-static

* fix ffprobe-static version

* fix metadata generate

* fix auto transcribe if pending

* remove console.log
2024-02-11 17:21:39 +08:00
an-lee
bc22a5e2b4 Feat: refactor STT service (#294)
* add stt hook interface

* fix crypto exported to browser

* refactor use-transcribe

* may use openai stt

* refactor: remove decprecated codes

* fix undefined method
2024-02-10 19:55:07 +08:00
an-lee
338ef82a1e Fix recording sync (#291)
* delete audio/video/recording in remote

* sync recordings on profile page

* handle recording sync failed
2024-02-09 18:24:36 +08:00
an-lee
a95c247c8c Refactor whisper config (#287)
* may use custom whisper exe

* refactor whisper config code

* display progress when using local whisper transcribe
2024-02-09 17:07:21 +08:00
an-lee
abde169ead Fix openai proxy (#244)
* add create messages in batch

* add use conversation

* update conversation shortcut

* add speech handler

* tts in renderer

* fix speech create
2024-02-02 00:41:23 +08:00
an-lee
51a810fdfd Feat: may setup proxy (#238)
* add https proxy

* remove proxy in renderer

* proxy work for openai request

* use proxyAgent to enable system proxy

* add proxy setting

* tweak proxy setting
2024-02-01 15:33:37 +08:00
an-lee
00cbc8403b Feat: add Enjoy AI as option (#206)
* add enjoyAI as option

* use enjoyai config

* may call enjoyai

* may set default ai engine

* refactor setting context

* refactor preferences

* add warning when openai key not provided

* tweak locale

* update duration for audio/video

* add balance settings

* may select ai role when create conversation

* may forward message from conversation

* tweak ui

* refactor transcribe method

* refactor ai commands to hooks

* fix webapi

* tweak playback rate options

* add playMode, next & prev, ref: #124

* upgrade deps

* may skip whisper model download

* audios/videos default order by updated_At
2024-01-31 00:04:59 +08:00
an-lee
cec9d73bc8 Feat: transcribe from web (#204)
* add transcribe from web

* transcribe from web

* add azure speech ai

* fix azure speech output

* may select stt service

* fix UI

* remove debug code

* lint

* fix default stt service

* tweak

* fix secondsToTimestamp
2024-01-27 00:45:06 +08:00
an-lee
99577c5020 Feat: more login methods (#195)
* add login code api

* use container for webview

* improve browserview

* may login via xuesheng account

* locale

* fix ui
2024-01-25 11:02:26 +08:00
an-lee
3a98bd9f40 Feat: use ffmpeg wasm (#177)
* add ffmepg wasm

* refactor whisper config

* refactor ffmpeg config

* remove depected codes

* refactor model download

* decouple whipser & ffmpeg when transcribe

* do not always toast not found error

* ffmpeg works in renderer

* transcode in renderer & transcript in main

* improve transcode

* refactor

* remove ffmpeg check

* fix whisper config check

* update whisper check

* enable SharedArrayBuffer

* remove ffmpeg setting

* fix UI

* handle ffmpeg not ready

* may create issue when whisper check failed

* upgrade deps

* hide ffmpeg setting if not ready

* hide record button when not in region
2024-01-23 14:39:16 +08:00
an-lee
fe8f14bda7 ask for media access before record (#172) 2024-01-20 10:49:53 +08:00
an-lee
5efc5fc1db Feat: AI commands (#145)
* upgrade deps

* add extract command

* add lookup.command

* update lookup command

* fix locals

* may lookup one by one

* update lookup method

* add translate command

* cache translation by default

* open ai default settings

* use openai config in context

* refactor

* genreate ipa

* update UI

* handle ai generate fail
2024-01-19 16:52:49 +08:00
an-lee
98d8061600 add sample for checking whisper & ffmpeg 2024-01-19 13:52:05 +08:00
an-lee
3f2b6e78e8 fix recording calendar level out of range (#121) 2024-01-15 22:48:30 +08:00
an-lee
b545ea2362 Feat: save waveform as file (#118)
* package rpm

* cache waveform data as file in library

* clear waveform data in db

* fix some css
2024-01-15 16:57:44 +08:00
an-lee
187038c42e Feat: scan ffmpeg command (#116)
* package rpm

* fix model url

* scan/check ffmpeg command

* handle undefined

* add reset settings button

* add ffmpeg install instrunction for mac

* improve landing steps
2024-01-15 14:12:22 +08:00
an-lee
d2510d00cb Feat: more preferences (#106)
* add ffmpeg command check

* may switch language

* tweak
2024-01-14 16:54:15 +08:00
an-lee
0ecaf4bdff may share recording 2024-01-13 15:04:28 +08:00
an-lee
1243076bbb add audio/video & send prompt from share zone 2024-01-13 01:09:40 +08:00
an-lee
3dff4330a1 new post type 2024-01-12 15:47:33 +08:00
an-lee
f9b1c14b4c share audio/video & display post 2024-01-12 00:54:53 +08:00
an-lee
267eee37b9 remove web-api in preload 2024-01-11 17:37:21 +08:00
an-lee
94d4a0a338 add community page 2024-01-11 17:10:00 +08:00
an-lee
39bb9c8f30 add default hotkeys 2024-01-10 21:24:46 +08:00
an-lee
aebd9ee213 add enjoy app 2024-01-09 15:19:32 +08:00