首页IT科技tkinter 滑块(PyKHTML, a Python interface to KHTML)

tkinter 滑块(PyKHTML, a Python interface to KHTML)

时间2025-05-24 05:44:45分类IT科技浏览7388
导读:PyKHTML is... A Python module for writing website scrapers/spiders. Whereas traditional methods focus on writing the code to parse HTML/forms themselves, PyKH...

PyKHTML is...

A Python module for writing website scrapers/spiders. Whereas traditional methods focus on writing the code to parse HTML/forms themselves, PyKHTML uses the excellent KHTML engine to do all the trudge work. It therefore handles webpages very well (even the severely crufty ones) and is pretty darn fast (implemented in C++). As a bonus the module handles JavaScript and cookies transparently. Hurrah!

How?

PyKHTML requires PyKDE 3 (and hence in turn PyQt 3 + KDE libs). If you would like to run PyKHTML on servers without an X display then Xvfb is required. Fortunately these requirements should come bundled with most modern Linux distributions, and support for Windows/Mac should appear in the next few months.

Show me some code

Okay. Here is an example (one of many examples included in the bundle) that scrapes the title and navigation from this page, with excessive commenting to give you a feel of what programming with PyKHTML is like:

Note of Thanks

Gambit Research, a software company in West London, sponsor PyKHTML development.

声明:本站所有文章              ,如无特殊说明或标注                    ,均为本站原创发布             。任何个人或组织       ,在未征得本站同意时       ,禁止复制             、盗用                     、采集       、发布本站内容到任何网站             、书籍等各类媒体平台                     。如若本站内容侵犯了原著者的合法权益                    ,可联系我们进行处理       。

创心域SEO版权声明:以上内容作者已申请原创保护,未经允许不得转载,侵权必究!授权事宜、对本内容有异议或投诉,敬请联系网站管理员,我们将尽快回复您,谢谢合作!

展开全文READ MORE
php中url(phpcms urlrule不生效怎么办) vue路由重定向到外部url(vue访问未定义的路由时重定向404问题)