Python/Spider: Difference between revisions

From Fundamental Ramen
Jump to navigation Jump to search
No edit summary
No edit summary
 
Line 1: Line 1:
=== Soup ===
== Soup ==
<source lang="python">
<source lang="python">
</source>
</source>


=== Download ===
== Download ==
<source lang="python">
<source lang="python">
def download_file(url):
def download_file(url):

Latest revision as of 03:46, 6 August 2019

Soup

Download

def download_file(url):
    local_filename = url.split('/')[-1]
    # NOTE the stream=True parameter below
    with requests.get(url, stream=True) as r:
        r.raise_for_status()
        with open(local_filename, 'wb') as f:
            for chunk in r.iter_content(chunk_size=8192): 
                if chunk: # filter out keep-alive new chunks
                    f.write(chunk)
                    # f.flush()
    return local_filename