前面我们介绍了xml文件,今天我们试着用boost库来解析xml文件。我们将举两个例子来说明怎么使用。
来自boost官方的例子
先看xml文件的内容:
<debug><filename>debug.log</filename><modules><module>Finance</module><module>Admin</module><module>HR</module></modules><level>2</level>
</debug>
我们再来看如何使用boost读取和保存xml文件。
// ----------------------------------------------------------------------------
// Copyright (C) 2002-2006 Marcin Kalicinski
//
// Distributed under the Boost Software License, Version 1.0.
// (See accompanying file LICENSE_1_0.txt or copy at
// http://www.boost.org/LICENSE_1_0.txt)
//
// For more information, see www.boost.org
// ----------------------------------------------------------------------------#include <boost/property_tree/ptree.hpp>
#include <boost/property_tree/xml_parser.hpp>
#include <boost/foreach.hpp>
#include <string>
#include <set>
#include <exception>
#include <iostream>struct debug_settings
{std::string m_file; // log filenameint m_level; // debug levelstd::set<std::string> m_modules; // modules where logging is enabledvoid load(const std::string &filename);void save(const std::string &filename);
};void debug_settings::load(const std::string &filename)
{// Create empty property tree objectusing boost::property_tree::ptree;ptree pt;// Load XML file and put its contents in property tree. // No namespace qualification is needed, because of Koenig // lookup on the second argument. If reading fails, exception// is thrown.read_xml(filename, pt);// Get filename and store it in m_file variable. Note that // we specify a path to the value using notation where keys // are separated with dots (different separator may be used // if keys themselves contain dots). If debug.filename key is // not found, exception is thrown.m_file = pt.get<std::string>("debug.filename");// Get debug level and store it in m_level variable. This is // another version of get method: if debug.level key is not // found, it will return default value (specified by second // parameter) instead of throwing. Type of the value extracted // is determined by type of second parameter, so we can simply // write get(...) instead of get<int>(...).m_level = pt.get("debug.level", 0);// Iterate over debug.modules section and store all found // modules in m_modules set. get_child() function returns a // reference to child at specified path; if there is no such // child, it throws. Property tree iterator can be used in // the same way as standard container iterator. Category // is bidirectional_iterator.//BOOST_FOREACH(ptree::value_type &v, pt.get_child("debug.modules"))// m_modules.insert(v.second.data());}void debug_settings::save(const std::string &filename)
{// Create empty property tree objectusing boost::property_tree::ptree;ptree pt;// Put log filename in property treept.put("debug.filename", m_file);// Put debug level in property treept.put("debug.level", m_level);// Iterate over modules in set and put them in property// tree. Note that the add function places new key at the// end of list of keys. This is fine in most of the// situations. If you want to place item at some other// place (i.e. at front or somewhere in the middle),// this can be achieved using a combination of the insert// and put_value functionsBOOST_FOREACH(const std::string &name, m_modules)pt.add("debug.modules.module", name);// Write property tree to XML filewrite_xml(filename, pt); //write_xml(cout,pt); //这个函数有重载. 可以用流 也可直接用文件名. }int main()
{try{debug_settings ds;ds.load("debug_settings.xml");ds.save("debug_settings_out.xml");std::cout << "Success\n";}catch (std::exception &e){std::cout << "Error: " << e.what() << "\n";}return 0;
}
解析:
load函数:
首先定义了解析树
using boost::property_tree::ptree;ptree pt;
然后读取xml文件
接下来三行代码,读取文件里的内容。
我们注意到:
上面的xml的根节点是debug。然后有三个节点:filename,modules,level。
其中modules是一个含有子节点的复合节点。
于是:
1.
m_file = pt.get<std::string>("debug.filename");
读取filename。如读取失败,则抛出异常。
2.
m_level = pt.get("debug.level", 0);
获取level数,当然了我们也可以通过和前一句一样的语法获取m_level:
m_level = pt.get<int>("debug.level");
但是同样这句话一旦获取不到,就会抛出异常,如果我们想获取不到,返回一个默认值0呢?此时可以使用
m_level = pt.get("debug.level", 0);
来实现。其中最后返回值的类型通过默认值来推断,非常类似c++11的auto语法。
3.
BOOST_FOREACH(ptree::value_type &v, pt.get_child("debug.modules"))m_modules.insert(v.second.data());
由于modules是一个复合节点,我们可以通过循环遍历的方法访问节点的子节点。
BOOST_FOREACH类似c++11的for(auto& value: range)
循环遍历的第一句就是:<module>Finance</module>
,而v.first==module,v.second==Finance,但是我们要通过data()来获取。
我们可以通过改变上述语句为下面语句验证我的推断:
BOOST_FOREACH(ptree::value_type &v, pt.get_child("debug.modules")){std::cout << v.first<< " "<<v.second.data()<<std::endl;m_modules.insert(v.second.data());}
值得注意的是我测试的时候发现获取first加不加.data()都可以,但获取second必须加.data().
save函数
实际上是read的翻译版,只需将get换成put即可.我们只要按照变量对应的标签加即可。
另一个更复杂的例子
xml文件如下:
<debug name="debugname"><file name="debug.log"/><modules type="internal"><module1>Finance_Internal</module1><module2>Admin_Internal</module2><module3>HR_Internal</module3></modules><modules type="external"><module>Finance_External</module><module>Admin_External</module><module>HR_External</module> </modules>
</debug>
分析以上xml文件,我们会发现此刻带有了属性,还有深层嵌套。分析起来,稍复杂一些。前面我们讲过xml文件中属性其实可以看成子元素的形式。因此我们对debug遍历的时候,第一句应该是name="debugname"
,第二句是<file name="debug.log"/>
第三句是:
<modules type="internal"><module1>Finance_Internal</module1><module2>Admin_Internal</module2><module3>HR_Internal</module3></modules>
第四句是: <modules type="external"><module>Finance_External</module><module>Admin_External</module><module>HR_External</module> </modules>
然后我们看代码:
#include <iostream>
#include <string>
#include <boost/property_tree/ptree.hpp>
#include <boost/property_tree/xml_parser.hpp>
#include <boost/foreach.hpp>using namespace std;
using namespace boost::property_tree;int main(void){ptree pt;read_xml("debug_settings2.xml", pt);//loop for every node under debugBOOST_FOREACH(ptree::value_type &v1, pt.get_child("debug")){if (v1.first == "<xmlattr>"){ //it's an attribute//read debug name="debugname"cout << "debug name=" << v1.second.get<string>("name") << endl;}else if (v1.first == "file"){//read file name="debug.log"cout << " file name=" << v1.second.get<string>("<xmlattr>.name") << endl;}else{ // v1.first == "modules"//get module typecout << " module type:" << v1.second.get<string>("<xmlattr>.type") << endl;//loop for every node under modulesBOOST_FOREACH(ptree::value_type &v2, v1.second){if (v2.first == "<xmlattr>"){ //it's an attribute//this can also get module typecout << " module type again:" << v2.second.get<string>("type") << endl;}else{//all the modules have the same structure, so just use data() function.cout << " module name:" << v2.second.data() << endl;}}//end BOOST_FOREACH}}//end BOOST_FOREACH
}
注意:
对于属性来说,first指”<xmlattr>
“,而不是“name”,v.second指的是name的具体值.
参考文献:
1.使用Boost property tree来解析带attribute的xml
2.http://www.boost.org/doc/libs/1_46_1/doc/html/boost_propertytree/tutorial.html