How to Extract Script and CSS Files from Web Pages in Python

作者

2023年08月22日

更新时间

13.15 分钟

阅读时间

阅读量

To extract script and CSS files from a web page in Python, you can use the BeautifulSoup library along with the requests library to send an HTTP request to the web page and parse its HTML content. Here’s an example Python code that demonstrates how to extract script and CSS URLs from a web page:

import requests
from bs4 import BeautifulSoup

url = 'https://www.example.com/page.html'

response = requests.get(url)

if response.status_code == 200:
    soup = BeautifulSoup(response.content, 'html.parser')

    # Extract script and CSS URLs
    script_urls = [script['src'] for script in soup.find_all('script', src=True)]
    css_urls = [link['href'] for link in soup.find_all('link', rel='stylesheet')]

    print('Scripts:')
    print('\n'.join(script_urls))
    print('')
    print('CSS:')
    print('\n'.join(css_urls))
else:
    print('Request failed')

In this code, we first send an HTTP GET request to the web page and use the BeautifulSoup library to parse its HTML content. We then extract the URLs of all script and CSS files by searching for the script and link tags with the appropriate attributes using find_all() method.

Note that the scripts and CSS URLs might be relative URLs, so you may need to use urllib.parse.urljoin() method to create an absolute URL from them.

How to Extract Script and CSS Files from Web Pages in Python

How to Generate Random Data in Python

How to download a remote image and save it to your local ma…

博客作者

GLM 是真敢删啊？！说好的 P0 安全规范呢？

如果要投票一个最弱智的ai模型一定是千问

告别手动拼接：PromptForge 如何重新定义你的 AI 工作流

Privacy Policy for TerryVoiceRead Chrome Extension

告别龟速！NAS迅雷内测体验，速度起飞，附邀请码！