Problem Set 4
CS 6347
Due: 4/25/2024 by 11:59pm
Note: all answers should be accompanied by explanations for full credit. Late homeworks
cannot be accepted. All submitted code MUST compile/run.
Problem 1: Expectation Maximization for Colorings (40 pts)
For this problem, we will use the same factorization as we have in past assignments. As on the
previous assignment, the weights will now be considered parameters of the model that need to be
learned from samples.
Suppose that some of the vertices, L ⊆ V , are latent variables in the model. Given m samples
of the observed variables in V L, what is the log-likelihood as a function of the weights? Perform
MLE using the EM algorithm. Your solution should be written as a MATLAB function that takes
as input an n × n matrix A corresponding to the adjacency matrix of a graph G, an n-dimensional
binary vector L whose non-zero entries correspond to the latent variables, and samples which is an
n × m k-ary matrix where samplesi,t corresponds to observed color for vertex i in the t
th sample
(you should discard any inputs related to the latent variables). The output should be the vector of
weights w corresponding to the MLE parameters for each color from the EM algorithm. Note that
you should use belief propagation to approximate the counting problem in the E-step.
function w = colorem(A, L, samples)
Problem 2: EM for Bayesian Networks (60pts)
For this problem, you will use the house-votes-84.data data set provided with this problem set.
Each row of the provided data file corresponds to a single observation of a voting record for a
congressperson: the first entry is party affiliation and the remaining entries correspond to votes on
different legislation with question marks denoting missing data.
1. Using the first three features and the first 300 data observations only, fit a Bayesian network
to this data using the EM algorithm for each of the eight possible complete DAGs over three
variables.
2. Do different runs of the EM algorithm produce different models?
3. Evaluate your eight models, on the data that was not used for training, for the task of
predicting party affiliation given the values of the other two features. Is the prediction highly
请加QQ:99515681 邮箱:99515681@qq.com WX:codinghelp
- Excel服务器2025实现了不用安装Excel也能实现Excel共享
- 无界智造 场域共生丨荣事达智能房屋闪耀亮相2025世界制造业大会
- 连连数字CEO辛洁受邀出席INVESTOPIA全球系列对话·中国论坛 与业内共探中阿投资合作机遇
- 共话AI赋能数字化转型 重构企业智能管理新生态
- 三星官宣5月13日举行新品发布会,超轻薄Galaxy S25 Edge发布
- HGC环电强化国际业务领导架构 谭君骥及Ravindran Mahalingam分别担任专精职务
- 海伯森六维力传感器:助力人形机器人产业发展的创新力量
- 达闼董事长黄晓庆:以技术破局致胜从未止步
- 从辅助到核心,企业如何基于AI Agent升级品牌数字营销
- 国产2.5亿超高分辨率图像传感器发布,主要面向机器视觉领域
- 西部数据推出多款超高速、大容量存储解决方案
- 中关村e谷承办“科创耀未来 奋进谱新篇”企业家创新论坛圆满落幕
- 航科卫星“汕头数字一号”卫星发射成功!
- Gartner 最新魔力象限出炉!ManageEngine卓豪成功入围
- 科技重塑物流,英特尔&集和诚加速智慧物流发展!