<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Backlinks</title>
	<atom:link href="http://climbtothestars.org/archives/2002/07/09/backlinks/feed/" rel="self" type="application/rss+xml" />
	<link>http://climbtothestars.org/archives/2002/07/09/backlinks/</link>
	<description>More than just a blog.</description>
	<pubDate>Mon, 07 Jul 2008 02:38:03 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Stephanie</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-559</link>
		<dc:creator>Stephanie</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-559</guid>
		<description>&lt;p&gt;The backlinks script is written! Much faster than I expected. Just stuck 
with some cosmetic issues. Expect a new page in the coding section soon!&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>The backlinks script is written! Much faster than I expected. Just stuck<br />
with some cosmetic issues. Expect a new page in the coding section soon!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Amélie Poulain</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-560</link>
		<dc:creator>Amélie Poulain</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-560</guid>
		<description>&lt;p&gt;y a qq chose qui ne marche pas :-(&lt;/p&gt;

&lt;p&gt;je vais voir le truc des espions et nettoyage ordi.&lt;/p&gt;

&lt;p&gt;merci.&lt;/p&gt;

&lt;p&gt;tu connais &lt;a href="http://www.cexx.org/adware.htm" rel="nofollow"&gt;http://www.cexx.org/adware.htm&lt;/a&gt; et
 &lt;a href="http://www.privacy.net/analyze/" rel="nofollow"&gt;http://www.privacy.net/analyze/&lt;/a&gt;
 worth a look.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>y a qq chose qui ne marche pas <img src='http://climbtothestars.org/wp/wp-includes/images/smilies/icon_sad.gif' alt=':-(' class='wp-smiley' /> </p>
<p>je vais voir le truc des espions et nettoyage ordi.</p>
<p>merci.</p>
<p>tu connais <a href="http://www.cexx.org/adware.htm" rel="nofollow">http://www.cexx.org/adware.htm</a> et<br />
 <a href="http://www.privacy.net/analyze/" rel="nofollow">http://www.privacy.net/analyze/</a><br />
 worth a look.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stephanie</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-561</link>
		<dc:creator>Stephanie</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-561</guid>
		<description>&lt;p&gt;If you want to give me a hand, I'm stuck with this part: read the first x 
lines of a remote page and output the text in between the  tags into a 
variable.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>If you want to give me a hand, I&#8217;m stuck with this part: read the first x<br />
lines of a remote page and output the text in between the  tags into a<br />
variable.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stephanie</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-562</link>
		<dc:creator>Stephanie</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-562</guid>
		<description>&lt;p&gt;Next step: collect google search terms. That will be fun!&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Next step: collect google search terms. That will be fun!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jane Doe</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-563</link>
		<dc:creator>Jane Doe</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-563</guid>
		<description>&lt;p&gt;Reading a web page with php is easy !
 $page=readfile("http://www.disenchanted.com/dis/technology/xanadu.html");
 will get you the source of the page.&lt;/p&gt;

&lt;p&gt;(another solution is to use system("wget..."), which is a safer solution 
since wget will handle retry and timeout for ya)&lt;/p&gt;

&lt;p&gt;To parse $page,
 use regexp or a xml parser,
 to extract what you want (title,
 maybe the &#60;link rel="index" to catch the permalink 
 and the &#60;link rel="alternate" type="application/rss+xml" to catch the 
RSS feed are obvious candidate)&lt;/p&gt;

&lt;p&gt;Regexp are usually a better solution since HTML is not often valid xml&lt;/p&gt;

&lt;p&gt;You can also try to use preg_split to find the text before and after the 
link that caused the backlink activation.&lt;/p&gt;

&lt;p&gt;The concept of &lt;em&gt;x lines&lt;/em&gt; is not obvious with html source...
 you can try to catch the content of the first &#60;p&gt; &#60;/p&gt;
 or the first sentence (a sentence ends with  !?. and a space)&lt;/p&gt;

&lt;p&gt;To remove all markup, use preg_replace
 see &lt;a href="http://www.php.net/manual/en/function.preg-replace.php" rel="nofollow"&gt;http://www.php.net/manual/en/function.preg-replace.php&lt;/a&gt; for the code&lt;/p&gt;

&lt;p&gt;Another source of help might be Biz, from Niutopia, since he has some perl 
code to do the same thing (grabbing title and some text about it)
 &lt;a href="http://joueb.com/niutopia/news/155.shtml" rel="nofollow"&gt;http://joueb.com/niutopia/news/155.shtml&lt;/a&gt;&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Reading a web page with php is easy !<br />
 $page=readfile(&#8221;http://www.disenchanted.com/dis/technology/xanadu.html&#8221;);<br />
 will get you the source of the page.</p>
<p>(another solution is to use system(&#8221;wget&#8230;&#8221;), which is a safer solution<br />
since wget will handle retry and timeout for ya)</p>
<p>To parse $page,<br />
 use regexp or a xml parser,<br />
 to extract what you want (title,<br />
 maybe the &lt;link rel=&#8221;index&#8221; to catch the permalink<br />
 and the &lt;link rel=&#8221;alternate&#8221; type=&#8221;application/rss+xml&#8221; to catch the<br />
RSS feed are obvious candidate)</p>
<p>Regexp are usually a better solution since HTML is not often valid xml</p>
<p>You can also try to use preg_split to find the text before and after the<br />
link that caused the backlink activation.</p>
<p>The concept of <em>x lines</em> is not obvious with html source&#8230;<br />
 you can try to catch the content of the first &lt;p> &lt;/p><br />
 or the first sentence (a sentence ends with  !?. and a space)</p>
<p>To remove all markup, use preg_replace<br />
 see <a href="http://www.php.net/manual/en/function.preg-replace.php" rel="nofollow">http://www.php.net/manual/en/function.preg-replace.php</a> for the code</p>
<p>Another source of help might be Biz, from Niutopia, since he has some perl<br />
code to do the same thing (grabbing title and some text about it)<br />
 <a href="http://joueb.com/niutopia/news/155.shtml" rel="nofollow">http://joueb.com/niutopia/news/155.shtml</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stephanie</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-564</link>
		<dc:creator>Stephanie</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-564</guid>
		<description>&lt;p&gt;not sure if this is the best, but it's working! (not sure either how the 
php code will come out with this comments script):&lt;/p&gt;

&lt;p&gt;$referrer_page = 
implode('',file("http://www.disenchanted.com/dis/technology/xanadu.html"));
 preg_match('!]&lt;em&gt;&gt;(.&lt;/em&gt;)!', $referrer_page, $array_of_matches);
 $referrer_title=$array_of_matches[1];&lt;/p&gt;

&lt;p&gt;Actually, readfile spits the whole thing out on the page without waiting 
for me to tell it to do so.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>not sure if this is the best, but it&#8217;s working! (not sure either how the<br />
php code will come out with this comments script):</p>
<p>$referrer_page =<br />
implode(&#8221;,file(&#8221;http://www.disenchanted.com/dis/technology/xanadu.html&#8221;));<br />
 preg_match(&#8217;!]<em>>(.</em>)!&#8217;, $referrer_page, $array_of_matches);<br />
 $referrer_title=$array_of_matches[1];</p>
<p>Actually, readfile spits the whole thing out on the page without waiting<br />
for me to tell it to do so.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Morgazilla</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-565</link>
		<dc:creator>Morgazilla</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-565</guid>
		<description>&lt;p&gt;Bonjour dans les etoiles, il a l'air bien ce script, mais ne veut pas de 
moi :), on dirait que c'est le titre qui ne passe pas, peut etre a cause 
de l'apostrophe ? "...J'ai arrete"
 @ un autre jour&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Bonjour dans les etoiles, il a l&#8217;air bien ce script, mais ne veut pas de<br />
moi :), on dirait que c&#8217;est le titre qui ne passe pas, peut etre a cause<br />
de l&#8217;apostrophe ? &#8220;&#8230;J&#8217;ai arrete&#8221;<br />
 @ un autre jour</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stephanie</title>
		<link>http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-566</link>
		<dc:creator>Stephanie</dc:creator>
		<pubDate>Tue, 30 Nov 1999 00:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://climbtothestars.org/archives/2002/07/09/backlinks/#comment-566</guid>
		<description>&lt;p&gt;Les referrers s'affichent seulement après au moins deux visites de deux ip 
différentes. Le tien s'affiche maintenant. Je vais mettre manuellement le 
nom dans la base, c'est vrai qu'il y a eu un blème! (apostrophe ou serveur 
bizarre, je sais pas...)&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Les referrers s&#8217;affichent seulement après au moins deux visites de deux ip<br />
différentes. Le tien s&#8217;affiche maintenant. Je vais mettre manuellement le<br />
nom dans la base, c&#8217;est vrai qu&#8217;il y a eu un blème! (apostrophe ou serveur<br />
bizarre, je sais pas&#8230;)</p>
]]></content:encoded>
	</item>
</channel>
</rss>
