With regex how do i match between an XML tag multiple times?

First, before you say anything, i HAVE to do this because the RSS is malformed, but i can't correct it on my end. So, while I tried using an RSS and a XML parser, they fail and i only have front end access. However, i'm super close, but i can't figure out why this wont match.

The feed (it's a long 1 line string): http://pastebin.com/5dJhXCvf

First Example:

<title>(.+)</title>

This i thought worked great with my test of just:

<title>&quot;cterrorism task force&quot; location:oregon - Google News</title>

But the issue is that it matches everything then as one match for example:

<title>&quot;cterrorism task force&quot; location:oregon - Google News</title><title>&quot;cterrorism task force&quot; location:oregon - Google News</title>

Equals 1 result item in my array from exec() and match()

So i tried:

<title>([\w\d\s\=\%\_\`\~\+\!\@\#\$\%\^\&\*\(\)\:\'\"\[\]\{\}\|\,\.\/]+)</title>

But that returns nothing... Any ideas?

Answers


Try non-greedy version <title>(.+?)<\/title>. Here you can test these things online.


Need Your Help

Basic MSI installer not replacing deleted files on re install

installer windows-installer

I can be missing something very simple but for some reason on reinstall my basic MSI installer (Installshield 2009) is not writing deleted files from the installed directory, it write the entire

Find files with same name but different content

python linux sed

I need to find files with the same name but different content in a linux folder structure with a lot of files.