Skip to content

Help setting speed using CLI with multispeech #1112

Closed
@sv-ctrl

Description

@sv-ctrl

Checks

  • This template is only for usage issues encountered.
  • I have thoroughly reviewed the project documentation but couldn't find information to solve my problem.
  • I have searched for existing issues, including closed ones, and couldn't find a solution.
  • I am using English to submit this issue to facilitate community communication.

Environment Details

set up using the instructions in the readme. windows 11.

Steps to Reproduce

using multi-example with CLI and .toml file

✔️ Expected Behavior

No response

❌ Actual Behavior

So first off, I love this repo and everyone involved in it. It is excellent. The documentation is excellent, and the gradio app is so easy to use.

HOWEVER. Having gone through the documentation and the apps I am having an issue that I hope can be easily solved, but the closest answer I found in the discussions is "see gradio_app advanced settings".

I am using the CLI to do multispeech, almost exactly like the example. Where I'm struggling is I need to set the speed for various speech types to be different. An example would be Country mouse speed=0.5 and Town mouse speed =0.8.

Is this possible to pass to the main infer.py with a .toml file? How would you go about this?

Additionally, if I were to fine-tune models for different speakers, could we use a different model for each speaker? Is that possible or not in the current environment?

Thanks for any help you can give!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requesthelp wantedExtra attention is needed

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions