Managing Web-based data - database models and transformations
Tóm tắt
The paper considers the Araneus data model which employs database techniques and wrappers to extract data from and generate Web sites. The project features a logical model that abstracts physical aspects of Web sites. Araneus provides high-level descriptions of pages that let us both extract data from the Web and generate Web sites from databases.
Từ khóa
#Data mining #Data models #HTML #Spatial databases #Technology management #Internet #Information analysis #Humans #XML #Relational databasesTài liệu tham khảo
10.1145/290593.290605
atzeni, 1997, to weave the web, Proc 23rd Int l Conf Very Large Data Bases
fernandez, 2000, declarative specification of web sites with strudel, VLDB Journal, 9, 38, 10.1007/s007780050082
10.1145/306101.306137
crescenzi, 2001, roadrunner: towards automatic data extraction from large web sites, Proc 27th Int l Conf Very Large Data Bases
10.1016/S0306-4379(98)00028-3
atzeni, 2001, data-intensive web sites: design and maintenance, World Wide Web Internet and Web Information Systems, 4, 21
10.1016/S1389-1286(00)00040-2
knoblock, 2000, accurately and reliably extracting data from the web: a machine learning approach, IEEE Data Eng Bull, 23, 33
10.1016/S0004-3702(99)00100-9
10.1145/358108.358110
10.1145/296854.277639
10.1007/s007990050001
hammer, 1997, extracting semistructured information from the web, Proc Workshop on the Management of Semistructured Data
baumgartner, 2001, visual web information extraction with lixto, Proc Int l Conf Very Large Data Bases
10.1145/271074.271078
buneman, 1996, a query language and optimization techniques for unstructured data, ACM SIGMOD Int l Conf Management of Data, 505, 10.1145/235968.233368
abiteboul, 1997, the lorel query language for semistructured data, J Digital Libraries, 1, 68, 10.1007/s007990050005
10.1109/PDIS.1996.568671
10.1109/ICDE.1998.655754