Scilab Website | Contribute with GitLab | Mailing list archives | ATOMS toolboxes
Scilab Online Help
6.0.2 - Русский

Change language to:
English - Français - 日本語 - Português -

Please note that the recommended version of Scilab is 2024.0.0. This page might be outdated.
See the recommended documentation of this function

Справка Scilab >> XML Management > htmlRead

htmlRead

Read a HTML stream from a local or distant file

Syntax

doc = htmlRead(path [, encoding])

Arguments

path

a string, the path to the file to read.

encoding

a string, the file encoding.

doc

a mlist typed XMLDoc

Description

Read and parse a HTML file. The returned document allows to access to the DOM tree which is kept in memory.

In general, a HTML file cannot be read with a XML parser because a HTML is rarely well-formatted, so a HTML parser is required and it is more compliant.

Once the HTML file is parsed, it can be seen as a XML file in memory, so usual operations can be done.

The encoding argument is used to precise the file encoding.

It is important to notice that the tree must be freed (to avoid memory leaks) with the function xmlDelete.

Examples

doc = htmlRead("http://www.scilab.org");
e = doc.root.children(1).children(1)
e.name
e.content

// Now we can retrieve all the nodes with an 'href' attribute
q = xmlXPath(doc, "//*[@href]");
q(1).attributes

// We delete the doc
xmlDelete(doc);

See also

History

ВерсияОписание
5.5.0 HTML features added.
Report an issue
<< htmlDump XML Management htmlReadStr >>

Copyright (c) 2022-2023 (Dassault Systèmes)
Copyright (c) 2017-2022 (ESI Group)
Copyright (c) 2011-2017 (Scilab Enterprises)
Copyright (c) 1989-2012 (INRIA)
Copyright (c) 1989-2007 (ENPC)
with contributors
Last updated:
Thu Feb 14 15:05:02 CET 2019