如何使 Mediapipe 姿态估计更快（Python）

Question

如何使 Mediapipe 姿态估计更快（Python）

pythonpython-3.xartificial-intelligencepose-estimationmediapipe

6

我正在为我的游戏制作姿势估计脚本。然而，即使没有fps限制，它的工作速度只有20-30帧每秒，且没有使用到整个CPU。同时，它也没有充分利用整个GPU。有人能帮我吗？

这是播放舞蹈视频时的资源使用情况：https://imgur.com/a/6yI2TWg 以下是我的代码：

import cv2
import mediapipe as mp
import time

inFile = '/dev/video0'

capture = cv2.VideoCapture(inFile)
FramesVideo = int(capture.get(cv2.CAP_PROP_FRAME_COUNT)) # Number of frames inside video
FrameCount = 0 # Currently playing frame
prevTime = 0

# some objects for mediapipe
mpPose = mp.solutions.pose
mpDraw = mp.solutions.drawing_utils
pose = mpPose.Pose()

while True:
    FrameCount += 1
    #read image and convert to rgb
    success, img = capture.read()
    imgRGB = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
    
    #process image
    results = pose.process(imgRGB)

    if results.pose_landmarks:
        mpDraw.draw_landmarks(img, results.pose_landmarks, mpPose.POSE_CONNECTIONS)
        #get landmark positions
        landmarks = []
        for id, lm in enumerate(results.pose_landmarks.landmark):
            h, w, c = img.shape 
            cx, cy = int(lm.x * w), int(lm.y * h) 
            cv2.putText(img, str(id), (cx,cy), cv2.FONT_HERSHEY_PLAIN, 1, (255,0,0), 1)
            landmarks.append((cx,cy))
 
    # calculate and print fps
    frameTime = time.time()
    fps = 1/(frameTime-prevTime)
    prevTime = frameTime
    cv2.putText(img, str(int(fps)), (30,50), cv2.FONT_HERSHEY_PLAIN, 3, (255,0,0), 3)

    #show image
    cv2.imshow('Video', img)
    cv2.waitKey(1)
    if FrameCount == FramesVideo-1:
        capture.release()
        cv2.destroyAllWindows()
        break

- Hamster

1个回答

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- MmBaguette · Accepted Answer

将mp.Pose的model_complexity设置为0。

根据文档：

MODEL_COMPLEXITY 姿势标记模型的复杂度：0、1或2。随着模型复杂度的增加，标记的精度以及推理延迟通常会增加。默认为1。

这是我找到的最佳解决方案，也请使用此方法。