Commit Graph

71 Commits

Author SHA1 Message Date
an-lee
49dabc89a3 Update constants from api (#607)
* fix caption ipa display

* fetch gpt/tts providers from API

* fetch remote gpt presets

* update constants

* fix conversavtion save

* refactor ipa convert

* fetch ipa mapping from api

* fix ipa mark

* fix constant

* validate camdict pron audio src
2024-05-14 20:37:51 +08:00
an-lee
156222f28c Feat: may download segment (#601)
* may download segment audio

* display original audio with the recording in post
2024-05-13 10:25:28 +08:00
an-lee
0e8de4881c Feat: lookup in context menu (#595)
* add lookup widget

* add lookup result

* refactor

* refactor

* add translate widget

* make translate widget works

* refactor locales

* typo

* remove deprecated component

* refactor lookup prompt
2024-05-10 11:16:38 +08:00
an-lee
69a6f721ca Feat summarize audio topic (#594)
* refactor ai commands

* fix json command

* fix extract story command

* may summarize topic for audio
2024-05-09 10:54:11 +08:00
an-lee
5436b2038c fix active region toggle 2024-05-08 16:23:08 +08:00
an-lee
8799f5d479 Feat: may update transcription (#592)
* may update transcription

* update locales & ui

* transcription post process should not break transcription
2024-05-08 16:16:14 +08:00
an-lee
5e56235bde auto stop recording in one minute (#568) 2024-04-26 16:32:37 +08:00
an-lee
0644c3bbd7 Feat: make notes on caption (#544)
* add segment model

* add note model

* db handle segment & note

* add notes & segments handler

* refactor media caption components

* segment & note create

* fix type

* update note column & may sync

* display selected words for note

* refactor selected words

* auto select words when editing note

* refactor

* refactor caption component

* display notes

* refactor notes components

* fix

* refactor segment & notes into context

* destroy note

* update locale

* fix caption switch issue

* fix layout

* refactor caption layout

* remove deprecated code

* may share note

* improve UI

* fix notes list auto update after created

* remove console.log

* add notes page

* refactor note parameters

* refactor components

* mark note on transcription

* handle no notes

* improve style

* improve style

* show context menu on selection text

* fix utils
2024-04-26 15:05:36 +08:00
an-lee
e9c043ce95 Fix hotkeys (#539)
* fix compare hotkey in sm layout
2024-04-19 10:48:12 +08:00
an-lee
8947b66007 upgrade echogarden dep (#535)
* upgrade echogarden dep

* avoid duplicate render when after recording

* update trim threshold when compare recording

* update recording trim threhold when saving
2024-04-17 10:37:30 +08:00
an-lee
093302a771 Fix style (#534)
* fix conversation textarea style

* fix style in dark mode ref #530
2024-04-17 08:58:20 +08:00
an-lee
955571e5d2 Feat copy caption with ipa & refactor some style (#516)
* refactor sidebar

* fix vocabulary style

* fix conversation page in dark mode

* remove focus-visible:ring

* copy caption with ipa
2024-04-12 12:43:55 +08:00
an-lee
ac39ccaf6f Improve layout (#510)
* fix assessment layout

* improve player layout

* refactor sidebar

* default system theme

* may toggle theme

* fix calendar in dark theme

* fix style in dark mode

* improve player layout
2024-04-11 16:03:17 +08:00
slarkvan
5b87d218ac Feat/custom hotkeys (#496)
* feat: 2024-04-07 15:27:52 - custom hotkeys

* feat: 2024-04-07 15:52:13 - add custome compare shortcuts

* feat: 2024-04-09 10:17:23 - Modify the code according to the code review suggestions and optimize the experience.

---------

Co-authored-by: more.tai <more.tai@huolala.cn>
2024-04-09 14:52:51 +08:00
an-lee
1a4df1426b ensure recording is synced before share (#492) 2024-04-07 14:51:32 +08:00
Harry
096a7395b2 Fea: add hotkey for compare function (#483)
* add hotkey for compare function

* remove duplicate hotkeys

* add id="media-compare-button" to small layout
2024-04-04 12:12:23 +08:00
an-lee
f0f4319044 Refactor transcription (#476)
* word-level timestamp is not needed for alignment

* remove deprecated code

* fix error when stop recording
2024-04-02 14:03:02 +08:00
an-lee
265429a24e Feat: denoise recording & clean code (#473)
* denoise recording before saved

* Refactor audio processing and recording logic

* Remove unused code

* use echogarden to transcode

* remove ffmpeg-wasm

* add echogarden decode

* remove deprecated code

* ensure use posix path

* refactor echogarden transcode

* refactor recording denoise

* clean code

* expose align error in toast

* remove unused code
2024-04-02 11:10:19 +08:00
an-lee
85989e025f fix: avoid seek before decoded 2024-04-01 10:45:01 +08:00
an-lee
e777a8e082 fix up missing recording buttons in sm layout (#459) 2024-03-29 16:43:12 +08:00
an-lee
c55f05f721 fix style 2024-03-29 09:04:23 +08:00
an-lee
98d3fa4b99 fix dash mark in transcript 2024-03-29 06:44:56 +08:00
an-lee
2bea14fbdf handle undefined 2024-03-28 16:46:49 +08:00
an-lee
bdc3865b9d try to fix undefined issue 2024-03-28 16:40:47 +08:00
an-lee
da567f44c2 fix Infinity issue 2024-03-28 16:28:52 +08:00
an-lee
fc82286f61 handle undefined 2024-03-28 16:19:40 +08:00
an-lee
daa1c3a995 fix translation cached (#455) 2024-03-28 14:49:04 +08:00
an-lee
e63d77cd82 Fix layout (#454)
* player layout autosize

* fix video layout

* improve style
2024-03-28 10:35:32 +08:00
an-lee
63ce2206e1 Fix bugs (#436)
* fix recording style

* refine prompt
2024-03-25 10:02:07 +08:00
an-lee
3600895a60 Feat: add camdict (#435)
* add camdict db & logic

* refactor camdict

* refactor media caption

* display camdict result
2024-03-23 17:46:26 +08:00
an-lee
03d081592c Feat: improve player caption (#433)
* update ipa mapping

* refactor player caption layout
2024-03-22 10:20:13 +08:00
an-lee
8fc8905afc Feat: may download files(recording/audio/video/speech) (#431)
* may download recording

* may download recording/audio/video/speech
2024-03-22 08:44:48 +08:00
an-lee
1388c120b7 fix caption translate & analysis 2024-03-21 11:28:43 +08:00
an-lee
6dac70daa4 Feat: improve player (#429)
* tune pitch params

* may generate meaning groups

* auto group words

* update caption style

* refactor caption

* add analysis

* use conversation to analyze

* fix caption tabs

* add command component
2024-03-21 11:10:00 +08:00
an-lee
f8b3e2a15d Improve layout (#428)
* add Source Code Pro for IPA display

* audo fit player width

* update IPA mapping

* fix caption render

* pre-process transcription timeline for - and %

* tweak

* fix video player

* fix locale
2024-03-19 15:41:39 +08:00
an-lee
3b770eaf3e Feat: handle decode errors (#425)
* show decode error

* handle loading error in post audio/recording

* tweak

* upgrade deps

* fix logout

* update build-enjoy-app.yml
2024-03-18 18:53:16 +08:00
an-lee
251eada86b fix transcription 2024-03-18 16:27:52 +08:00
an-lee
e139899885 fix ipa convert 2024-03-18 14:31:00 +08:00
an-lee
3541dd9cbf fix ipa display 2024-03-18 13:39:18 +08:00
an-lee
2a766cc9c6 a litter pause between loop 2024-03-18 13:38:30 +08:00
an-lee
0f10f3d0e9 ipa convert 2024-03-18 13:17:08 +08:00
an-lee
de89ae73bf Force alignment for tts audio (#418)
* add originalText as param

* save original text when added from tts speech

* fix player in conversation sheet

* minor fix
2024-03-18 12:55:28 +08:00
an-lee
90f38e9226 Transcription force alignment & more (#416)
* add wavesurfer-provider

* brand new layout for player

* refactor pitch contour

* clean up

* update styl

* refactor

* update layout

* use new layout for video

* refactor

* may select word

* may edit word timestamp

* may toggle multiselect words

* clean code

* improve word region update

* improve layout

* update layout

* add echogarden

* fix test

* use aligned transcription

* fix ipa

* some refactor

* improve code

* implement ipa & translate & lookup

* recording play & share

* fix

* fix post audio

* improve layout

* may delete recording

* may record

* fix video player layout

* fix player in conversation

* render recording along with orignal audio

* may custom create region in recording

* fix float issue when seekTo

* fix recording player

* fix load more recordings

* fix seekTo

* clean up

* refactor pitch contour

* fix some warnings

* upgrade deps

* fix group transcription sentence

* zoom to fit when segment update

* add more hotkeys

* update player layout

* improve style

* play recording overlap audio when comparing

* update echogarden dep

* add recorded mark on transcription

* fix recording pitch contour rendering

* improve recording

* adjust pitch finder params
2024-03-16 19:42:37 +08:00
an-lee
8037273549 Feat: add tts conversation (#341)
* may create tts type conversation

* support tts reply

* upgrade deps

* test: e2e for create TTS conversation

* test: e2e for gpt conversation

* test: e2e for from create conversation to add speech audio to library

* refactor use-conversation

* generate speech before create msg in tts conversation

* refactor conversation-shorts

* revert change in 1000-hours

* revert sass dep changed in 1000-hours

* fix CI
2024-02-22 16:00:17 +08:00
an-lee
bb06c9e191 tweak media player control 2024-02-20 06:17:52 +08:00
an-lee
06f8d32169 Feat improve media player page (#320)
* download transcription from api when transcribed

* add friendly loading panel on audio page

* add loading panel for video page

* fix locale

* remove ubuntu 20.04 for e2e test

* do not auto upload recording
2024-02-19 11:01:52 +08:00
an-lee
57d6efa547 Feat: refactor ffmpeg config (#296)
* bundle ffmpeg wasm in package

* remove ffmpeg discover & add ffmpeg-static

* fix ffprobe-static version

* fix metadata generate

* fix auto transcribe if pending

* remove console.log
2024-02-11 17:21:39 +08:00
an-lee
bc22a5e2b4 Feat: refactor STT service (#294)
* add stt hook interface

* fix crypto exported to browser

* refactor use-transcribe

* may use openai stt

* refactor: remove decprecated codes

* fix undefined method
2024-02-10 19:55:07 +08:00
an-lee
a95c247c8c Refactor whisper config (#287)
* may use custom whisper exe

* refactor whisper config code

* display progress when using local whisper transcribe
2024-02-09 17:07:21 +08:00
an-lee
37cb3c7fe8 Improve player controls & assessment display (#275)
* zoom to fit as default

* display phoneme score in assessment
2024-02-06 23:59:19 +08:00