![]() System : Linux absol.cf 5.4.0-198-generic #218-Ubuntu SMP Fri Sep 27 20:18:53 UTC 2024 x86_64 User : www-data ( 33) PHP Version : 7.4.33 Disable Function : pcntl_alarm,pcntl_fork,pcntl_waitpid,pcntl_wait,pcntl_wifexited,pcntl_wifstopped,pcntl_wifsignaled,pcntl_wifcontinued,pcntl_wexitstatus,pcntl_wtermsig,pcntl_wstopsig,pcntl_signal,pcntl_signal_get_handler,pcntl_signal_dispatch,pcntl_get_last_error,pcntl_strerror,pcntl_sigprocmask,pcntl_sigwaitinfo,pcntl_sigtimedwait,pcntl_exec,pcntl_getpriority,pcntl_setpriority,pcntl_async_signals,pcntl_unshare, Directory : /usr/share/doc/libhtml-parser-perl/examples/ |
Upload File : |
For most of these scripts if you run them with a file argument, where the file contains some HTML, you should get some output. The 'h*sub' scripts take two arguments the first of which is a perl expression and the second an HTML file. In any case all of the files have an exlanatory comment. For example try running: lynx -dump -source -raw http://www.debian.org > /tmp/a.txt ./hanchors /tmp/a.txt Of course if http://www.debian.org is not your favourite web site you can make the appropriate substitution. hanchors - List all anchors in the HTML hlc - Correct any upper case tags to lower case hstrip - Removes deprecated scripting and styling tags and attributes htextsub - Apply arbirary perl expression to all text within HTML hrefsub - Apply arbirary perl expression to all hrefs within HTML htitle - Print title of the HTML document hdump - Output event information whilst parsing HTML document hform - Print analysis of form controls present in HTML htext - Print all the text from the HTML