Empirical Best Practices On Using Product-Specific Schema.org

Mayank Kejriwal,Ravi Kiran Selvam,Chien-Chun Ni,Nicolas Torzec

doi:10.1609/aaai.v35i17.17816

Abstract

Schema.org has experienced high growth in recent years. Structured descriptions of products embedded in HTML pages are now not uncommon, especially on e-commerce websites. The Web Data Commons (WDC) project has extracted schema.org data at scale from webpages in the Common Crawl and made it available as an RDF `knowledge graph' at scale. The portion of this data that specifically describes products offers a golden opportunity for researchers and small companies to leverage it for analytics and downstream applications. Yet, because of the broad and expansive scope of this data, it is not evident whether the data is usable in its raw form. In this paper, we do a detailed empirical study on the product-specific schema.org data made available by WDC. Rather than simple analysis, the goal of our study is to devise an empirically grounded set of best practices for using and consuming WDC product-specific schema.org data. Our studies reveal five best practices, each of which is justified by experimental data and analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Empirical Best Practices On Using Product-Specific Schema.org

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 1

Similar Papers

The Web Data Commons Structured Data Extraction
...
-
, et. al. ...
17 Mar 2017
17 Mar 2017

Inducing Schema.org markup from Natural Language Context
Gautam Kishore Shahi ... Sushma Kumari
-
Gautam Kishore Shahi, et. al.Gautam Kishore Shahi ... Sushma Kumari
25 May 2019
25 May 2019

Best practice fusion of CMMI-DEV v1.2 (PP, PMC, SAM) and PMBOK 2008
Christiane Gresse Von Wangenheim ... Rafael Prikladnicki
Information and Software Technology | VOL. 52
Christiane Gresse Von Wangenheim, et. al.Christiane Gresse Von Wangenheim ... Rafael Prikladnicki
29 Mar 2010
Best practice fusion of CMMI-DEV v1.2 (PP, PMC, SAM) and PMBOK 2008
Christiane Gresse Von Wangenheim ... Rafael Prikladnicki

AlzPED: Optimizing the predictive power of drug efficacy studies in Alzheimer’s disease animal models
Shreaya Chakroborty ... Zane Martin
Alzheimer's & Dementia | VOL. 17
Shreaya Chakroborty, et. al.Shreaya Chakroborty ... Zane Martin
01 Dec 2021
Alzheimer's & Dementia | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Empirical Best Practices On Using Product-Specific Schema.org

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence