EEE-6512EEL-4930 Image Processing and Computer Vision Spring 2024 Homework #7 (Optional)
March 21, 2024
Due: April 22, 2024, 11:59 PM
This assignment should be completed individually by the student. Proper citation should be provided for any references used.
Please read the requirements carefully. Solutions that do not follow the provided specifications will not receive credit. You are free to use any built-in/toolbox functions within MATLAB to accomplish this task, except functions from the deep learning toolbox. Data and background were taken from the Sign Language MNIST Kaggle dataset page [1].
Background: The original MNIST image dataset of handwritten digits is a popular benchmark for image-based artificial intelligence methods, but researchers have renewed efforts to update it and develop drop-in replacements that are more challenging for computer vision and original for real-world applications. American Sign Language (ASL) MNIST is one such dataset, consisting of images of hand gestures that represent a multi-class problem with 24 classes of letters (excluding J and Z, which require motion). This has applications in live ASL/spoken word translation. See provided asl_reference.png for the ASL letters. For more information, please see [1].
Data: The dataset format is patterned to match closely with the classic MNIST. Data is stored in CSV format with:
1. a label (0-25) as a one-to-one map for each alphabetic letter A (and no cases for 9=J or 25=Z because of gesture motions)
2. pixel1, pixel2, .. pixel784 which represent a single image. Data was preprocessed by cropping to hands-only, gray-scaling to uint8 bit depth, and resizing to a 28x28 pixel image.
3. For this assignment, we are only concerned with the letters A-D, inclusive. Please use the provided asl_mnist.csv.
Challenge: Write a function, myASLTranslate, which:
• accepts a single 28x28 uint8 greyscale image and returns a single character, either “A”,
“B”, “C”, or “D”.
• You must:
o Use at least one filter on the grayscale image
o Use at least one morphological image processing operation
o Use at least one region feature from section Chapter 12
o Include in your report the accuracy of your function on all data in asl_mnist.csv
• Note: Your function will be tested on 25 randomly sampled images taken from the provided asl_mnist.csv. Code must achieve at least 80% accuracy on the sampled dataset to receive credit.
To receive full credit, you should submit two files. 1.) A document containing an explanation of how your code works, (.DOC, .DOCX, or PDF file) 2.) An M-file containing commented MATLAB code for the program myASLTranslate. Students should ensure that their M-files execute without errors to avoid receiving point deductions.
References
请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- Excel服务器2025实现了不用安装Excel也能实现Excel共享
- 无界智造 场域共生丨荣事达智能房屋闪耀亮相2025世界制造业大会
- 连连数字CEO辛洁受邀出席INVESTOPIA全球系列对话·中国论坛 与业内共探中阿投资合作机遇
- 共话AI赋能数字化转型 重构企业智能管理新生态
- 三星官宣5月13日举行新品发布会,超轻薄Galaxy S25 Edge发布
- HGC环电强化国际业务领导架构 谭君骥及Ravindran Mahalingam分别担任专精职务
- 海伯森六维力传感器:助力人形机器人产业发展的创新力量
- 达闼董事长黄晓庆:以技术破局致胜从未止步
- 从辅助到核心,企业如何基于AI Agent升级品牌数字营销
- 国产2.5亿超高分辨率图像传感器发布,主要面向机器视觉领域
- 西部数据推出多款超高速、大容量存储解决方案
- 中关村e谷承办“科创耀未来 奋进谱新篇”企业家创新论坛圆满落幕
- 航科卫星“汕头数字一号”卫星发射成功!
- Gartner 最新魔力象限出炉!ManageEngine卓豪成功入围
- 科技重塑物流,英特尔&集和诚加速智慧物流发展!