Homework 7
Computer Vision, Spring 2024
Due Date: April 26, 2024
Total Points: 20
This homework contains two programming challenges. All submissions are due at
midnight on April 26, 2024, and should be submitted according to the instructions
in the document “Guidelines for Programming Assignments.pdf”.
runHw7.py will be your main interface for executing and testing your code.
Parameters for the different programs or unit tests can also be set in that file.
Before submission, make sure you can run all your programs with the command
python runHw7.py with no errors.
The numpy package is optimized for operations involving matrices and
vectors. Avoid using loops (e.g., for, while) whenever possible—looping can
result in long running code. Instead, you should “vectorize” loops to optimize
your code for performance. In many cases, vectorization also results in more
compact code (fewer lines to write!).
Challenge 1: In this challenge you are asked to develop an optical flow system. You
are given a sequence of 6 images (flow1.png – flow6.png) of a dynamic scene. Your
task is to develop an algorithm that computers optical flow estimates at each image
point using the 5 pairs (1&2, 2&3, 3&4, 4&5, 5&6) of consecutive images.
Optical flow estimates can be computed using the optical flow constraint equation
and Lucas-Kanade solution presented in class. For smooth motions, this algorithm
should produce robust flow estimates. However, given that the six images were
taken with fairly large time intervals in between consecutive images, the brightness
and temporal derivatives used by the algorithm are expected to be unreliable.
Therefore, you are advised to implement a different (and simpler) optical flow
algorithm. Given two consecutive images (say 1 and 2), establish correspondences
between points in the two images using template matching. For each image point in
the first image, take a small window (say 7x7) around the point and use it as the
template to find the same point in the second image. While searching for the
corresponding point in the second image, you can confine the search to a small
window around the pixel in the second image that has the same coordinates as the
2
one in the first image. The center of the 7x7 image window in the second image that
is maximally correlated with the 7x7 window in the first image is assumed to be the
corresponding point. The vector between two corresponding points is the optical
flow (u,v).
Write a program computeFlow that computes optical flow between two gray-level
images, and produces the optical flow vector field as a “needle map” of a given
resolution, overlaid on the first of the two images.
result = computeFlow(img1, img2, win_radius, template_radius,
grid_MN)
You need to choose a value for the grid spacing that gives good results without
taking excessively long to compute. (6 points)
For debugging purposes use the test case in debug1a. In this synthetic case, the flow
field consists of horizontal vectors of the same magnitude (translational motion
parallel to the image plane). Note that in the real case, foreshortening effects,
occlusions, and reflectance variations (as well as noise) complicate the result.
(2 point)
Challenge 2: Your task is to develop a vision system that tracks the location of an
object across video frames. Object tracking is a challenging problem since an
object’s appearance, pose and scale tend to change as time progresses. In class we
have discussed three popular tracking methods: template-based tracking,
histogram-based tracking and detection-based tracking. In this challenge, we will
assume the color distribution of an object stays relatively constant over time.
Therefore, we will track an object using its color histogram.
A color histogram describes the color distribution of a color image. The color
histogram that you will need to compute is defined as follows. Each bin of the color
histogram represents a range of colors, and the number of votes in each bin
indicates the number of pixels that have the colors within the corresponding color
range.
Be careful, in the initialization of your program, you should generate a color map
from the region of interest (ROI), and compute all subsequent color histograms
based on the same color map. It is only meaningful to compare two histograms
computed based on the same color map. Use the provided function chooseTarget
to drag a rectangle around a tracking target.
3
Write a program named trackingTester that estimates the location of an object in
video frames.
trackingTester(data_params, tracking_params)
trackingTester should draw a box around the target in each video frame, and
save all the annotated video frames as PNGs into a subfolder given in
data_params.out_dir.
After generating the annotated video frames, use the provided function
generateVideo to create a video file containing all the frames.
(12 points)
Include all the code you have written, as well as the resulting video files, but
DO NOT include the three tracking datasets and the individual output frames
in your submission.
请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- 山东双嘉家庭教育走进历城一中:开展心理讲座赋能青春梦想
- 春天里·议国是|共担文化使命——张传斌
- 追寻共同记忆 乔治·何克亲属来沪祭奠
- 三部门新政力推智能家居 居然智家数智化转型迎利好东风
- 海棠情缘,京津和鸣 菜百股份亮相京津两市海棠花节
- 福布斯首次举办餐饮领军人物评选,旺旺蔡旺庭凭借邦德咖啡馆布局获奖
- 供应链峰会赋能中国制造业,点燃增长新引擎 深圳立合供应链技术
- 越秀北京 梧桐星宸|当产品力遇见年轻力,大城生活一步达成
- 连续2年登榜,安保医疗解锁呼吸机核心技术!
- 航空盛宴!沃飞长空航展展示AE200最新技术成果
- 老牌保险公司|泰康人寿怎么样?实力真的靠谱吗?
- 创维光伏:用“光储充云”一体化解决方案助力低碳发展
- 第89届CMEF展会:AI引领产业升级,国产医疗设备获海外客户青睐
- 辣味不简单!三养火辣酱再出奶油味新品让你“一蘸到底”
- 周星驰新片《少林女足》在台湾省举办海选,吸引了不少素人和足球爱好者前来参加