1 /** 2 ?????* 提取HTML标签的属性值 3 ?????* @param source HTML标签内容 ?4 ?????* ?????"<a title=中国体育报 href=‘‘>aaa</a><a title=‘北京日报‘ href=‘‘>bbb</a>" 5 ?????* @param element 标签名称 ???a 6 ?????* @param attr 标签属性 ??title 7 ?????* @return 8 ?????*/ 9 ????public static List<String> match(String source, String element, String attr) {10 ????????List<String> result = new ArrayList<String>();11 ????????String reg = "<" + element + "[^<>]*?\\s" + attr + "=[‘\"]?(.*?)[‘\"]?\\s.*?>";12 ????????Pattern pt = Pattern.compile(reg);13 ????????Matcher m = pt.matcher(source);14 ????????while (m.find()) {15 ????????????String r = m.group(1);16 ????????????result.add(r);17 ????????}18 ????????return result;19 ????}
正则表达式,提取html标签的属性值
原文地址:https://www.cnblogs.com/guoxiangyue/p/10318827.html