﻿ 计算机网络代写 - CS/RBE 549 Computer Vision|学霸联盟

# 一站式論文代寫,英国、美国、澳洲留学生Essay代寫—FreePass代写

1. Object Representation (8 pts): We can represent an object by its boundary (??(??), ??(??)), 0 ≤ ?? ≤ ?? where S is the length of the object’s boundary and s is distance along that boundary from some arbitrary starting point. We can combine x and y into a single complex function ??(??) = ??(??) + ????(??). The Discrete Fourier Transform (DFT) of z is ??(??) = ∑???2?????????? ???1 ??=0 ??(??), 0 ≤ ?? ≤ ?? ? 1 We can use the coefficients ??(??) to represent the object boundary. The limit on s is S-1 because for a closed contour ??(??) = ??(0). The Inverse Discrete Fourier Transform is ??(??) = 1??∑??+2???????? ?? ???1 ??=0 ??(??), 0 ≤ ?? ≤ ?? ? 1 a. Suppose that the object is translated by (???, ???), that is, ??′(??) = ??(??) + ??? + ?????. How is ??′’s DFT ??′(??) related to ??(??)? b. What object has ??(??) = ?? cos 2???? ?? + ???? sin 4???? ?? ? Sketch it. c. What is ??(??) corresponding to ??(??) from Part b? Hint: Most coefficients are 0. 2. Interpretation Tree (5 pts): We have a choice of matching detected image elements (edges) to the model or model elements to the object. Let E be the set of detected image edges and M the set of model edges. In the first case, matching image edge to model edges, we generate a tree of depth |??| and breadth |??| with tree size |??||??| . In the case of matching the model to image, we generate a tree of size |??||??| . We expect many more image elements than model elements – there may be many candidate image edges in a cluttered scene vs. a small number of model edges. a. Which approach is preferable, matching image edges to model or model to image edges? You might consider the case where there are 12 image edges and 5 model edges, for example. b. One advantage to using the interpretation tree approach is that it is possible to match an unknown object in the image to a model even if the object is partially occluded. We do this by allowing an object element to match a “null element” in the model. Does this change your answer to part a.? How and why? Or why not? 3. Stereo via Singular Value Decomposition (8 pts): Assume the usual stereo geometry, where the left and right cameras are offset by baseline ??? that is perpendicular to the common focal vector ?? . Then the stereo imaging equations are ?? ?? = |?? |2 ?? ? ?? ?? (?? ?? + ??? 2) , ?? ?? = |?? |2 ?? ? ?? ?? (?? ?? ? ??? 2) In the presence of imaging errors or noise, these equations might not hold exactly. They can be approximated by ?? ?? ? |?? |2 ?? ? ?? ?? (?? ?? + ??? 2) ≈ 0? , ?? ?? ? |?? |2 ?? ? ?? ?? (?? ?? ? ??? 2) ≈ 0? a. Show that these equations can be written as a 4x4 matrix operating on a column vector in homogeneous coordinates. [??? 0 0 ??? ???? ?????/2 ???? 0 ??? 0 0 ??? ???? ????/2 ???? 0 ] [??????????1??] ≈ 0? Hint: Combine the approximate imaging equations into a single matrix equation, multiply to eliminate the denominators, and simplify, not necessarily in that order! b. The above equation can be written as ?????′ ≈ 0? . We can use SVD to find the singular vector ???′ that minimizes |???? |2 subject to |?? |2 = 1. Express world point ?? ?? = [??, ??, ??]?? in terms of ???′ = [??′, ??′, ??′, ??′]??. c. When ???? = ????, show that a. gives ???? = ???? ?? , where ?? is the disparity. 4. Binary Image Matching (2 pts): Let ??1 and ??2 be binary images. Show that |??1 ? ??2|2 = ∑# of pixels where ??1 ≠ ??2 Where |??|2 = ∑ ?????? 2 is the sum of all (pixels squared) in I.

Essay_Cheery