Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wav2Lip: Accurately Lip-syncing Videos #2417

Merged
merged 24 commits into from
Sep 27, 2024

Conversation

aleksandr-mokrov
Copy link
Contributor

CVS-152780

Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@aleksandr-mokrov aleksandr-mokrov marked this pull request as ready for review September 26, 2024 19:47
@aleksandr-mokrov aleksandr-mokrov requested review from a team and eaidova and removed request for a team September 26, 2024 19:47
@@ -0,0 +1,326 @@
{
Copy link
Collaborator

@eaidova eaidova Sep 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure that "novel" word is suitable here (model released 3 years ago)


Reply via ReviewNB

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed

@@ -0,0 +1,326 @@
{
Copy link
Collaborator

@eaidova eaidova Sep 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Line #17.    pip_install("-q", "openvino>=2024.3.0")

2024.4.0,

also not sure how contrib and general opencv packages interact with each other (if I right remember contrib includes base and override content, but I may mistaken)


Reply via ReviewNB

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

openvino version is updated. I checked opencv packages, they are installing independently.

@@ -0,0 +1,326 @@
{
Copy link
Collaborator

@eaidova eaidova Sep 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Line #12.    sys.path.append(str(wav2lip_path))

as we discussed sometime ago, it is better to use insert(0) instead to append for avoiding mixing packages in env and too long variables cutting on windows


Reply via ReviewNB

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

@@ -0,0 +1,326 @@
{
Copy link
Collaborator

@eaidova eaidova Sep 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you please show reference audio too? it is also better to include logs for this cells that users can see expected result (or maybe provide some static example of expected result in markdown if you think that using videos makes notebook size too large)


Reply via ReviewNB

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Output is added, they are not too large: 2643KB is size of notebook in total

@@ -0,0 +1,554 @@
from os import listdir, path
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need separated file for inference? maybe it can be combined into one file with helper?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bbox and ov_inference are merged

img = torch.from_numpy(img).float().to(device)
BB, CC, HH, WW = img.size()

results = net({"x": img.numpy()})
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

slightly strange manipulation from numpy to torch and back without any actions between, ov model accepts torch tensors as well as numpy as input

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

converting to numpy is removed

@@ -0,0 +1,126 @@
from __future__ import print_function
Copy link
Collaborator

@eaidova eaidova Sep 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure that this import required, it is for old pytorch compatibility <3.6) and the same comment, can this file content be combined with other? too much helpers requires download more files for running notebook as standalone and makes code understanding less clear. it makes sense to separate them only for case if some functional may be optional (e.g. quantization utilities, user may skip quantization step and in this case this fucntions will not be used)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed, bbox and ov_inference are merged

- Convert the original model to OpenVINO Intermediate Representation (IR) format
- Compiling models and prepare pipeline
- Interactive inference

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you please put some images/animations related to inference results?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ezgif-5-b334ddc1b2

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This animation was added. In notebook properties previous one was replaced by this

@eaidova eaidova merged commit f74ddef into openvinotoolkit:latest Sep 27, 2024
13 of 14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants