Homework 7
Computer Vision, Spring 2024
Due Date: April 26, 2024
Total Points: 20
This homework contains two programming challenges. All submissions are due at
midnight on April 26, 2024, and should be submitted according to the instructions
in the document “Guidelines for Programming Assignments.pdf”.
runHw7.py will be your main interface for executing and testing your code.
Parameters for the different programs or unit tests can also be set in that file.
Before submission, make sure you can run all your programs with the command
python runHw7.py with no errors.
The numpy package is optimized for operations involving matrices and
vectors. Avoid using loops (e.g., for, while) whenever possible—looping can
result in long running code. Instead, you should “vectorize” loops to optimize
your code for performance. In many cases, vectorization also results in more
compact code (fewer lines to write!).
Challenge 1: In this challenge you are asked to develop an optical flow system. You
are given a sequence of 6 images (flow1.png – flow6.png) of a dynamic scene. Your
task is to develop an algorithm that computers optical flow estimates at each image
point using the 5 pairs (1&2, 2&3, 3&4, 4&5, 5&6) of consecutive images.
Optical flow estimates can be computed using the optical flow constraint equation
and Lucas-Kanade solution presented in class. For smooth motions, this algorithm
should produce robust flow estimates. However, given that the six images were
taken with fairly large time intervals in between consecutive images, the brightness
and temporal derivatives used by the algorithm are expected to be unreliable.
Therefore, you are advised to implement a different (and simpler) optical flow
algorithm. Given two consecutive images (say 1 and 2), establish correspondences
between points in the two images using template matching. For each image point in
the first image, take a small window (say 7x7) around the point and use it as the
template to find the same point in the second image. While searching for the
corresponding point in the second image, you can confine the search to a small
window around the pixel in the second image that has the same coordinates as the
2
one in the first image. The center of the 7x7 image window in the second image that
is maximally correlated with the 7x7 window in the first image is assumed to be the
corresponding point. The vector between two corresponding points is the optical
flow (u,v).
Write a program computeFlow that computes optical flow between two gray-level
images, and produces the optical flow vector field as a “needle map” of a given
resolution, overlaid on the first of the two images.
result = computeFlow(img1, img2, win_radius, template_radius,
grid_MN)
You need to choose a value for the grid spacing that gives good results without
taking excessively long to compute. (6 points)
For debugging purposes use the test case in debug1a. In this synthetic case, the flow
field consists of horizontal vectors of the same magnitude (translational motion
parallel to the image plane). Note that in the real case, foreshortening effects,
occlusions, and reflectance variations (as well as noise) complicate the result.
(2 point)
Challenge 2: Your task is to develop a vision system that tracks the location of an
object across video frames. Object tracking is a challenging problem since an
object’s appearance, pose and scale tend to change as time progresses. In class we
have discussed three popular tracking methods: template-based tracking,
histogram-based tracking and detection-based tracking. In this challenge, we will
assume the color distribution of an object stays relatively constant over time.
Therefore, we will track an object using its color histogram.
A color histogram describes the color distribution of a color image. The color
histogram that you will need to compute is defined as follows. Each bin of the color
histogram represents a range of colors, and the number of votes in each bin
indicates the number of pixels that have the colors within the corresponding color
range.
Be careful, in the initialization of your program, you should generate a color map
from the region of interest (ROI), and compute all subsequent color histograms
based on the same color map. It is only meaningful to compare two histograms
computed based on the same color map. Use the provided function chooseTarget
to drag a rectangle around a tracking target.
3
Write a program named trackingTester that estimates the location of an object in
video frames.
trackingTester(data_params, tracking_params)
trackingTester should draw a box around the target in each video frame, and
save all the annotated video frames as PNGs into a subfolder given in
data_params.out_dir.
After generating the annotated video frames, use the provided function
generateVideo to create a video file containing all the frames.
(12 points)
Include all the code you have written, as well as the resulting video files, but
DO NOT include the three tracking datasets and the individual output frames
in your submission.
请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- 广州四维营销:会议会务全托管,让企业招商会 “落地即成交”
- 平安人寿联合复旦大学重磅发布《新纪元·新价值:迎接黄金发展期》寿险行业白皮书
- 2025亚洲国际音乐大赛新加坡总决赛颁奖典礼精彩纷呈
- 十一度蝉联保险业第一!BrandZ《最具价值中国品牌100强》发布:中国平安位列中国品牌第九
- 3岁女童总爱凑近皱眉看人,检查竟非近视,而是得了“老年病”——福州爱尔眼科
- 2025上海别墅装修公司有哪些?高性价比推荐:1299 元 /㎡享品质装修
- 千金药业:1.5亿智能车间圆满封顶 女性健康产业升级加速
- 林华寿独创道家古法疗法攻克三尸虫取得重大突破
- 新愿景,新启航!研华加速从IPC迈入边缘计算新时代
- PIK3CA基因检测助力乳腺癌精准治疗新篇章
- 伽澜健康助力黄河金三角区域医疗机构临床营养学科建设论坛圆满举办
- 立屯精神聚合力 政企共建助振兴—中国人寿财险广西分公司联合天等县开展主题党日活动
- 《StellarSugar》12强选手惊现韩国最大中餐厅:燕京大饭店
- 有梦有方向 青春正飞扬 “青春影像”校园歌手征集最终展示活动将在重庆举办
- 山东双嘉家庭教育走进历城一中:开展心理讲座赋能青春梦想