DPM v5源码阅读(未完成)

TTransposition在他博客里面提到了,v3的版本和论文最接近,v4增加了模型数由2个增加为6个,v5提取了语义特征。
对比v3和v5的代码,发现代码的风格和结构都有不少的变化。
Here we go~

Demo.m

function demo()
startup;

fprintf('compiling the code...');
compile;
fprintf('done.\n\n');

load('VOC2010/car_final');
model.vis = @() visualizemodel(model, ...
                  1:2:length(model.rules{model.start}));
test('000034.jpg', model, -0.3);

load('INRIA/inriaperson_final');
model.vis = @() visualizemodel(model, ...
                  1:2:length(model.rules{model.start}));
test('000061.jpg', model, -0.3);

load('VOC2010/person_grammar_final');
model.class = 'person grammar';
model.vis = @() visualize_person_grammar_model(model, 6);
test('000061.jpg', model, -0.6);

load('VOC2010/bicycle_final');
model.vis = @() visualizemodel(model, ...
                  1:2:length(model.rules{model.start}));
test('000084.jpg', model, -0.3);

function test(imname, model, thresh)
cls = model.class;
fprintf('///// Running demo for %s /////\n\n', cls);

% load and display image
im = imread(imname);
clf;
image(im);
axis equal; 
axis on;
disp('input image');
disp('press any key to continue'); pause;
disp('continuing...');

% load and display model
model.vis();
disp([cls ' model visualization']);
disp('press any key to continue'); pause;
disp('continuing...');

% detect objects
[ds, bs] = imgdetect(im, model, thresh);
top = nms(ds, 0.5);
clf;
if model.type == model_types.Grammar
  bs = [ds(:,1:4) bs];
end
showboxes(im, reduceboxes(model, bs(top,:)));
disp('detections');
disp('press any key to continue'); pause;
disp('continuing...');

if model.type == model_types.MixStar
  % get bounding boxes
  bbox = bboxpred_get(model.bboxpred, ds, reduceboxes(model, bs));
  bbox = clipboxes(im, bbox);
  top = nms(bbox, 0.5);
  clf;
  showboxes(im, bbox(top,:));
  disp('bounding boxes');
  disp('press any key to continue'); pause;
end

fprintf('\n');

参考链接

1、 DPM(Defomable Parts Model) 源码分析-检测(二)
2、 DPM(Defomable Parts Model) 源码分析-训练(三)

猜你喜欢

转载自blog.csdn.net/yomo127/article/details/51122648