snipt

Ctrl+h for KB shortcuts

PHP

Fetch/scrape parts of an html page

<?php
// Fetch <div id="foobar"> from a webpage
$html = new DOMDocument(); 
$html->loadHtmlFile('http://example.com/some_page.html'); 
 
$xpath = new DOMXPath($html); 
$items = $xpath->query("//div[@id='foobar']"); // replace with your query

$dom = new DOMDocument;
foreach ($items as $item) {
    $dom->appendChild($dom->importNode($item,true));
}

// Output to screen
echo $dom->saveHTML();
https://snipt.net/embed/be2e34567698453a7f4229ac8fb0d373/
/raw/be2e34567698453a7f4229ac8fb0d373/
be2e34567698453a7f4229ac8fb0d373
php
PHP
15
2019-06-26T16:20:49
True
False
False
/api/public/snipt/25290/
fetch-parts-of-a-html-page
<table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><a href="#L-1"> 1</a> <a href="#L-2"> 2</a> <a href="#L-3"> 3</a> <a href="#L-4"> 4</a> <a href="#L-5"> 5</a> <a href="#L-6"> 6</a> <a href="#L-7"> 7</a> <a href="#L-8"> 8</a> <a href="#L-9"> 9</a> <a href="#L-10">10</a> <a href="#L-11">11</a> <a href="#L-12">12</a> <a href="#L-13">13</a> <a href="#L-14">14</a> <a href="#L-15">15</a></pre></div></td><td class="code"><div class="highlight"><pre><span></span><span id="L-1"><a name="L-1"></a><span class="cp">&lt;?php</span> </span><span id="L-2"><a name="L-2"></a><span class="c1">// Fetch &lt;div id=&quot;foobar&quot;&gt; from a webpage</span> </span><span id="L-3"><a name="L-3"></a><span class="nv">$html</span> <span class="o">=</span> <span class="k">new</span> <span class="nx">DOMDocument</span><span class="p">();</span> </span><span id="L-4"><a name="L-4"></a><span class="nv">$html</span><span class="o">-&gt;</span><span class="na">loadHtmlFile</span><span class="p">(</span><span class="s1">&#39;http://example.com/some_page.html&#39;</span><span class="p">);</span> </span><span id="L-5"><a name="L-5"></a> </span><span id="L-6"><a name="L-6"></a><span class="nv">$xpath</span> <span class="o">=</span> <span class="k">new</span> <span class="nx">DOMXPath</span><span class="p">(</span><span class="nv">$html</span><span class="p">);</span> </span><span id="L-7"><a name="L-7"></a><span class="nv">$items</span> <span class="o">=</span> <span class="nv">$xpath</span><span class="o">-&gt;</span><span class="na">query</span><span class="p">(</span><span class="s2">&quot;//div[@id=&#39;foobar&#39;]&quot;</span><span class="p">);</span> <span class="c1">// replace with your query</span> </span><span id="L-8"><a name="L-8"></a> </span><span id="L-9"><a name="L-9"></a><span class="nv">$dom</span> <span class="o">=</span> <span class="k">new</span> <span class="nx">DOMDocument</span><span class="p">;</span> </span><span id="L-10"><a name="L-10"></a><span class="k">foreach</span> <span class="p">(</span><span class="nv">$items</span> <span class="k">as</span> <span class="nv">$item</span><span class="p">)</span> <span class="p">{</span> </span><span id="L-11"><a name="L-11"></a> <span class="nv">$dom</span><span class="o">-&gt;</span><span class="na">appendChild</span><span class="p">(</span><span class="nv">$dom</span><span class="o">-&gt;</span><span class="na">importNode</span><span class="p">(</span><span class="nv">$item</span><span class="p">,</span><span class="k">true</span><span class="p">));</span> </span><span id="L-12"><a name="L-12"></a><span class="p">}</span> </span><span id="L-13"><a name="L-13"></a> </span><span id="L-14"><a name="L-14"></a><span class="c1">// Output to screen</span> </span><span id="L-15"><a name="L-15"></a><span class="k">echo</span> <span class="nv">$dom</span><span class="o">-&gt;</span><span class="na">saveHTML</span><span class="p">();</span> </span></pre></div> </td></tr></table>
dom, domxpath, html, php