BoW(词袋模型)+python代码实现

<div id="article_content" class="article_content clearfix csdn-tracking-statistics" data-pid="blog" data-mod="popu_307" data-dsm="post">
                    <link rel="stylesheet" href="https://csdnimg.cn/release/phoenix/template/css/ck_htmledit_views-e2445db1a8.css">
						<div class="htmledit_views">
                
<div id="article_content" class="article_content" style="line-height:26px;color:rgb(85,85,85);">
<h1><span style="line-height:24px;text-indent:28px;"><span style="font-family:'KaiTi_GB2312';font-size:32px;">BoW(词袋模型)+python代码实现</span></span></h1>
<div><span style="line-height:24px;text-indent:28px;"><span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><br></span></span></span></div>
</div>
<div class="article_content"><span style="font-family:'KaiTi_GB2312';font-size:24px;color:#666666;"><span style="line-height:24px;">文章是参考整理得来,大家可以去文章最后的参考文献中去查看原文!</span></span></div>
<div class="article_content" style="color:rgb(85,85,85);line-height:26px;">
<span style="font-family:'KaiTi_GB2312';"><span style="color:rgb(102,102,102);line-height:24px;"><span style="font-size:24px;">文档主要分以下几部分内容:</span></span><span style="color:rgb(102,102,102);line-height:24px;"><span style="font-size:24px;"><br><br>
&nbsp; &nbsp; &nbsp; 1.SIFT<br>
&nbsp; &nbsp; &nbsp; 2.BOW<br>
&nbsp; &nbsp; &nbsp; 3.VLAD(未包含,请直接在下面的资源链接中获取)</span><br><br></span></span></div>
<div class="article_content" style="line-height:26px;">
<span style="line-height:24px;"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:#666666;">实验需要安装 Python 2.7,推荐安装</span><span style="color:#ff0000;"><strong>anaconda</strong></span><span style="color:#666666;"> (python的科学计算发行版,包含很多的常用的库,不需要自己在一一去下载安装,很实用)。</span></span></span></div>
<div class="article_content" style="line-height:26px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;color:rgb(85,85,85);"><span style="color:rgb(102,102,102);line-height:24px;"></span></span>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"><strong><span style="color:#ff0000;">实验过程</span></strong></span></p>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"><strong><br></strong></span></p>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"><strong>1.下载数据库:</strong></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">ukbench数据库链接<a target="_blank">http://vis.uky.edu/~stewe/ukbench/</a></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(85,85,85);font-family:'KaiTi_GB2312';font-size:18px;line-height:26px;">数据库含有2550x4张图片,每4张为一组,总共2550组,每张图片的尺寸为640x480。</span></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">下载之后,把图片放在&nbsp;</span><strong><span style="font-size:18px;"><span style="color:#FF0000;">X:</span><span style="color:#FF0000;">\bow\dataset\train</span></span><span style="color:#FF0000;">&nbsp;&nbsp;</span></strong><span style="font-family:'KaiTi_GB2312';font-size:18px;">文件夹下面。(X是自己的盘符,下面语句
 X 是同样的含义)</span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><br></span></p>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"><strong>2.提取特征:</strong></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(34,34,34);">window平台:</span></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(34,34,34);">在</span><span style="color:rgb(34,34,34);">CMD</span><span style="color:rgb(34,34,34);">命令窗口输入一下命令,进=进入数据库和.py文件所在目录,运行蓝色命令行。</span></span></p>
<p style="color:rgb(85,85,85);"><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">X:</span></span></p>
<p style="color:rgb(85,85,85);"><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">cd bow</span></span></p>
<p style="color:rgb(85,85,85);"><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">dir</span></span></p>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:#3333ff;">python findFeatures.py -t dataset/train/</span></span></p>
<p style="color:rgb(85,85,85);"><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">结果会保存为bof.pkl文件。</span></span></p>
<p style="color:rgb(85,85,85);"><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">说明:</span></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(34,34,34);">&nbsp;这一步提取特征很慢,而且需要很强悍的电脑才行。。。。你可以只用数据库中的前</span><span style="color:rgb(34,34,34);">500</span><span style="color:rgb(34,34,34);">张图片(其他的删掉)来测试玩玩就行了,跑整个数据库的话,普通电脑肯定是跑不动的。。。。如果电脑太渣,直接使用我下面上传的bof.pkl,里面提取的是数据库前50张图片特征,你可以直接进行第三步测试查询,仍然需要用CMD进入到bow文件夹下,再使用下面的查询代码语句。</span></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(34,34,34);"><br></span></span></p>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"><strong>3. 查询:</strong></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(34,34,34);">在</span><span style="color:rgb(34,34,34);">CMD</span><span style="color:rgb(34,34,34);">命令窗口输入</span></span></p>
<p style="color:rgb(85,85,85);"><br><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><a style="font-family:'KaiTi_GB2312';font-size:18px;line-height:26px;" target="_blank">python search.py -</a><a style="font-family:'KaiTi_GB2312';font-size:18px;line-height:26px;" target="_blank">i</a><a style="font-family:'KaiTi_GB2312';font-size:18px;line-height:26px;" target="_blank">&nbsp;dataset/train/ukbenchxxxxx.jpg</a><br><a target="_blank"></a></span></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><span style="color:rgb(34,34,34);">ukbenchxxxxx.jpg</span><span style="color:rgb(34,34,34);">是文件名,自己更改为想要查询的图片名就可以了,比如</span></span></p>
<p style="color:rgb(85,85,85);"><span style="color:rgb(34,34,34);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><a target="_blank">python search.py -</a><a target="_blank">i</a><a target="_blank">
 dataset/train/ukbench00001.jpg</a></span></span></p>
<p><span style="font-family:SimSun;font-size:18px;color:#222222;">下面是效果图:</span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><img src="https://img-blog.csdn.net/20150723093638136?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""><br></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><br></span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;">注:最后倒数第2个链接为原作者文章和代码链接,也可以从下面我提供的代码链接来下载使用。</span></p>
<p style="color:rgb(85,85,85);"><span style="font-family:'KaiTi_GB2312';font-size:18px;"><br></span></p>
<p><span style="font-family:'KaiTi_GB2312';font-size:18px;"></span></p>
<p style="font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><strong><span style="color:#ff0000;">文档资源获取链接:</span></strong><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';">BOW文档资源</span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-family:'KaiTi_GB2312';"><span style="font-size:18px;">1.<a href="http://download.csdn.net/detail/lilai619/8865229" rel="nofollow" style="text-decoration:none;color:rgb(12,137,207);" target="_blank"></a><a href="http://download.csdn.net/detail/lilai619/8759105" rel="nofollow" style="text-decoration:none;color:rgb(12,137,207);" target="_blank"></a><a href="http://download.csdn.net/detail/lilai619/8865417" rel="nofollow" target="_blank">http://download.csdn.net/detail/lilai619/8865417</a>(PPT)</span><br></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';">2.<a href="http://wenku.baidu.com/view/bf3f1b19daef5ef7bb0d3c53" rel="nofollow" style="text-decoration:none;color:rgb(12,137,207);" target="_blank"></a><a href="http://wenku.baidu.com/view/96501db83968011ca30091ee" rel="nofollow" style="text-decoration:none;color:rgb(12,137,207);" target="_blank"></a><a href="http://wenku.baidu.com/view/6950c49d866fb84ae45c8daa" rel="nofollow" target="_blank">http://wenku.baidu.com/view/6950c49d866fb84ae45c8daa</a>(PPT)</span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-family:'KaiTi_GB2312';"><span style="font-size:18px;">BOW代码资源</span><br></span></p>
<p style="font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><span style="color:#555555;">1.</span><a href="http://download.csdn.net/detail/lilai619/8918851" rel="nofollow" style="color:rgb(85,85,85);" target="_blank">http://download.csdn.net/detail/lilai619/8918851</a><span style="color:#555555;">(</span><span style="color:#ff0000;"><strong>Python代码</strong></span><span style="color:#555555;">)</span></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';">VLAD文档资源</span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';">1.<a href="http://download.csdn.net/detail/lilai619/8918427" rel="nofollow" target="_blank">http://download.csdn.net/detail/lilai619/8918427</a>(VLAD论文)</span></span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;">2.<a href="http://download.csdn.net/detail/lilai619/8918445" rel="nofollow" target="_blank">http://download.csdn.net/detail/lilai619/8918445</a>(VLAD原作者的PPT)</span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;"><br></span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;">参考文献:</span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;">1.<a href="http://blog.csdn.net/abcjennifer/article/details/7639681" rel="nofollow" target="_blank">http://blog.csdn.net/abcjennifer/article/details/7639681</a>(SIFT博客)</span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;">2.<a href="http://blog.csdn.net/zddblog/article/details/7521424" rel="nofollow" target="_blank">http://blog.csdn.net/zddblog/article/details/7521424</a>(SIFT博客)</span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;">3.<a href="http://blog.csdn.net/willard_yuan/article/details/46519105" rel="nofollow" target="_blank">http://blog.csdn.net/willard_yuan/article/details/46519105</a>(bow+python博客)</span></p>
<p style="color:rgb(85,85,85);line-height:35px;">
<span style="font-family:'KaiTi_GB2312';font-size:18px;">4.<a href="http://blog.csdn.net/breeze5428/article/details/36441179" rel="nofollow" target="_blank">http://blog.csdn.net/breeze5428/article/details/36441179</a>(VLAD博客)</span></p>
<br><p><span style="font-family:'KaiTi_GB2312';font-size:18px;color:#ff0000;"><strong>部分文档预览:</strong></span></p>
<span style="font-family:'KaiTi_GB2312';"><span style="color:#555555;">&nbsp;<img src="https://img-blog.csdn.net/20150723093844133?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""></span><a href="http://download.csdn.net/detail/lilai619/8759105" rel="nofollow" style="text-decoration:none;color:rgb(12,137,207);" target="_blank"></a></span></div>
<div class="article_content" style="font-size:15px;color:rgb(85,85,85);line-height:26px;">
<span style="font-size:14px;color:rgb(102,102,102);line-height:24px;text-indent:28px;"><span style="font-family:'KaiTi_GB2312';"><span style="color:rgb(85,85,85);font-size:18px;line-height:35px;"></span></span></span></div>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><img src="https://img-blog.csdn.net/20150723093909584?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><img src="https://img-blog.csdn.net/20150723094141134?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><img src="https://img-blog.csdn.net/20150723094241403?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><br></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><span style="color:rgb(85,85,85);font-family:'KaiTi_GB2312';font-size:18px;line-height:35px;"><img src="https://img-blog.csdn.net/20150723094339114?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""><br></span></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><span style="color:rgb(85,85,85);font-family:'KaiTi_GB2312';font-size:18px;line-height:35px;"><img src="https://img-blog.csdn.net/20150723094406380?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center" alt=""><br></span></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<span style="font-size:18px;"><span style="font-family:'KaiTi_GB2312';"><span style="color:rgb(85,85,85);font-family:'KaiTi_GB2312';font-size:18px;line-height:35px;"><br></span></span></span></p>
<p style="color:rgb(85,85,85);font-size:15px;line-height:35px;">
<br></p>
<p style="color:rgb(85,85,85);font-family:'microsoft yahei';font-size:15px;line-height:35px;">
<br></p>
            </div>
                </div>

猜你喜欢

转载自blog.csdn.net/weixin_37753215/article/details/81661398