我有一个类似下面的字符串:
$string = "
<paragraph>apples are red...</paragraph>
<paragraph>john is a boy..</paragraph>
<paragraph>this is dummy text......</paragraph>
";
我想将这个字符串拆分成一个数组,包含在<paragraph></paragraph>
标记之间找到的文本。例如像这样:
$string = "
<paragraph>apples are red...</paragraph>
<paragraph>john is a boy..</paragraph>
<paragraph>this is dummy text......</paragraph>
";
$paragraphs = splitParagraphs($string);
/* $paragraphs now contains:
$paragraphs[0] = apples are red...
$paragraphs[1] = john is a boy...
$paragraphs[1] = this is dummy text...
*/
有什么想法吗?
另外,它应该是不区分大小写的,<paragraph>, <PARAGRAPH>, <Paragraph>
应该被视为相同。
编辑: 这不是XML,这里有很多东西会破坏XML的结构,因此我不能使用SimpleXML等工具。我需要一个正则表达式来解析它。