Entity List Research Articles

Entity-oriented search has revolutionized search engines. In the era of Google Knowledge Graph and Microsoft Satori, users demand an effortless process of search. Whether they express an information need through a keyword query, expecting documents and entities, or through a clicked entity, expecting related entities, there is an inherent need for the combination of corpora and knowledge bases to obtain an answer. Such integration frequently relies on independent signals extracted from inverted indexes, and from quad indexes indirectly accessed through queries to a triplestore. However, relying on two separate representation models inhibits the effective cross-referencing of information, discarding otherwise available relations that could lead to a better ranking. Moreover, different retrieval tasks often demand separate implementations, although the problem is, at its core, the same. With the goal of harnessing all available information to optimize retrieval, we explore joint representation models of documents and entities, while taking a step towards the definition of a more general retrieval approach. Specifically, we propose that graphs should be used to incorporate explicit and implicit information derived from the relations between text found in corpora and entities found in knowledge bases. We also take advantage of this framework to elaborate a general model for entity-oriented search, proposing a universal ranking function for the tasks of ad hoc document retrieval (leveraging entities), ad hoc entity retrieval, and entity list completion. At a conceptual stage, we begin by proposing the graph-of-entity, based on the relations between combinations of term and entity nodes. We introduce the entity weight as the corresponding ranking function, relying on the idea of seed nodes for representing the query, either directly through term nodes, or based on the expansion to adjacent entity nodes. The score is computed based on a series of geodesic distances to the remaining nodes, providing a ranking for the documents (or entities) in the graph. In order to improve on the low scalability of the graph-of-entity, we then redesigned this model in a way that reduced the number of edges in relation to the number of nodes, by relying on the hypergraph data structure. The resulting model, which we called hypergraph-of-entity, is the main contribution of this thesis. The obtained reduction was achieved by replacing binary edges with n -ary relations based on sets of nodes and entities (undirected document hyperedges), sets of entities (undirected hyperedges, either based on cooccurrence or a grouping by semantic subject), and pairs of a set of terms and a set of one entity (directed hyperedges, mapping text to an object). We introduce the random walk score as the corresponding ranking function, relying on the same idea of seed nodes, similar to the entity weight in the graph-of-entity. Scoring based on this function is highly reliant on the structure of the hypergraph, which we call representation-driven retrieval. As such, we explore several extensions of the hypergraph-of-entity, including relations of synonymy, or contextual similarity, as well as different weighting functions per node and hyperedge type. We also propose TF-bins as a discretization for representing term frequency in the hypergraph-of-entity. For the random walk score, we propose and explore several parameters, including length and repeats, with or without seed node expansion, direction, or weights, and with or without a certain degree of node and/or hyperedge fatigue, a concept that we also propose. For evaluation, we took advantage of TREC 2017 OpenSearch track, which relied on an online evaluation process based on the Living Labs API, and we also participated in TREC 2018 Common Core track, which was based on the newly introduced TREC Washington Post Corpus. Our main experiments were supported on the INEX 2009 Wikipedia collection, which proved to be a fundamental test collection for assessing retrieval effectiveness across multiple tasks. At first, our experiments solely focused on ad hoc document retrieval, ensuring that the model performed adequately for a classical task. We then expanded the work to cover all three entity-oriented search tasks. Results supported the viability of a general retrieval model, opening novel challenges in information retrieval, and proposing a new path towards generality in this area.

Read full abstract

U.S.-China Economic Tensions—Will Biden Get Right What Trump Got Wrong? Yukon Huang (bio) Although President Biden has vowed to reverse many of Trump's policies, both administrations see China as a strategic threat and great-power rival. This reflects popular sentiments expressed in various polls that China has become an "overwhelming geopolitical concern."1 Biden has characterized the U.S.-China confrontation as "a battle between the utility of democracies in the twenty-first century and autocracies."2 At the same time, Biden wants to avoid a total collapse in U.S.-China relations since China is a partner as well as competitor and rival, depending on the issue. If tensions are inevitable, then Biden's challenge is to differentiate between real issues where progress is desired and several misguided concerns that absorbed Trump's attention. The Trump administration's misguided concerns The Trump administration's first mistake was failing to recognize that trade deficits are not the central problem. President Trump's trade war with China was fueled by his belief that China was responsible for the United States' huge trade deficits, which contributed to lost manufacturing jobs and reduced competitiveness.3 However, trade deficits are not a good indicator of the state of the economy.4 For example, when an economy is doing well, increasing household incomes result in more imports and larger trade deficits. Furthermore, the United States has been running trade deficits for over forty years, long before China became a major economic power and exporter. In other words, the trade balances of the United States and China are not linked. When U.S. trade deficits soared in the late 1990s and early 2000s, China was not running significant trade surpluses. Later, when China's surpluses rose sharply, U.S. deficits declined. U.S. trade balances are largely driven by budget deficits and China's balances by rising household savings rates with urbanization—factors that have little to do with one another.5 Even if the goal had been to reduce the bilateral trade imbalance, the Trump administration's policy would still have made little sense. China cannot buy enough from the United States to bridge the trade deficit, in part because the latter does not produce enough of the high-end consumer goods or the raw materials that the former desires. Instead, Europe supplies China with much of its high-end consumer goods while Latin America and Africa provide much of its raw materials. Moreover, U.S. restrictions prevent sales of the high-tech products China wants due to reasons of national security. Such restrictions may be strategically aimed, but their impact on trade imbalances should not be underestimated.6 Sales of military equipment generated $175 billion for the United States in 2020.7 Aside from the understandable banning of military sales to China, U.S. export earnings are reduced by tightening licensing requirements imposed on China's purchases of hi-tech products for civilian use. In addition, more than 300 Chinese companies have been added over the past year to the Commerce Department's Entity List, which further restricts access to U.S. hi-tech products.8 Cutting off sales to Chinese firms like Huawei, ZTE, and other industrial leaders not only threatens their operational existence, but also represents substantial lost revenues for American suppliers. The U.S. Semiconductor Association, [End Page 246] for example, estimates losses of $30–50 billion if such restrictions are fully implemented.9 Therefore, it is no surprise that Trump's Phase One Trade Agreement committing China to buy more failed to achieve its desired effect of boosting U.S. exports to China and lowering its overall trade deficits.10 Second, Trump echoed popular but misguided sentiments that U.S. firms have been investing too much in China at the expense of the U.S. economy. Over the past two decades, only 1–2 percent of U.S. foreign investment has been going to China.11 By contrast, the European Union (EU), which is comparable to the United States in economic size, has been investing roughly twice as much. So the question is why the United States invests so little in China rather than so much...

Read full abstract

Entity List Research Articles

Related Topics

Articles published on Entity List

Huawei Is Quietly Dominating China’s Semiconductor Supply Chain

SF-GPT: A training-free method to enhance capabilities for knowledge graph construction in LLMs

Mechanism of strategic remoulding of high-dimensional industrial eco-economy and the reconstruction of carbon neutral industrial chain

Does the U.S.-China trade war stop? A novel event study on fake news and stock price in China

Does being included in an entity list enhance regulated firms’ mergers and acquisitions? Evidence from Chinese high-tech firms

Export controls and innovation performance: Unravelling the complex relationship between blacklisted Chinese firms and U.S. suppliers

The United States–China ‘tech war’: Decoupling and the case of Huawei

A Study on the Game Strategy of Chip Price Behavior at the Background of the US-China Trade War

Chronology of Practice: Chinese Practice in Private International Law in 2022

The Impact of Sino-US Trade Friction on China's Manufacturing Industry

Mapping U.S.-China technological “decoupling”: Beyond U.S.-China relations

US economic statecraft and great power competition

COVID-19, business continuity management and standardization: case study of Huawei

Between a Rock and a Hard Place Under China’s Anti-Sanction Law 2021: The Game- Theoretical Perspective

Chronology of Practice: Chinese Practice in Private International Law in 2020

Graph-based entity-oriented search

U.S.-China Economic Tensions—Will Biden Get Right What Trump Got Wrong?

China’s Counter-Strategy to American Export Controls in Integrated Circuits

Agglomerative clusterization with DBSCAN algorithm and iterative method

IS A DIGITAL GREAT WALL BEING BUILT BETWEEN THE U.S. AND CHINA?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Entity List Research Articles

Related Topics

Articles published on Entity List

Huawei Is Quietly Dominating China’s Semiconductor Supply Chain

SF-GPT: A training-free method to enhance capabilities for knowledge graph construction in LLMs

Mechanism of strategic remoulding of high-dimensional industrial eco-economy and the reconstruction of carbon neutral industrial chain

Does the U.S.-China trade war stop? A novel event study on fake news and stock price in China

Does being included in an entity list enhance regulated firms’ mergers and acquisitions? Evidence from Chinese high-tech firms

Export controls and innovation performance: Unravelling the complex relationship between blacklisted Chinese firms and U.S. suppliers

The United States–China ‘tech war’: Decoupling and the case of Huawei

A Study on the Game Strategy of Chip Price Behavior at the Background of the US-China Trade War

Chronology of Practice: Chinese Practice in Private International Law in 2022

The Impact of Sino-US Trade Friction on China's Manufacturing Industry

Mapping U.S.-China technological “decoupling”: Beyond U.S.-China relations

US economic statecraft and great power competition

COVID-19, business continuity management and standardization: case study of Huawei

Between a Rock and a Hard Place Under China’s Anti-Sanction Law 2021: The Game- Theoretical Perspective

Chronology of Practice: Chinese Practice in Private International Law in 2020

Graph-based entity-oriented search

U.S.-China Economic Tensions—Will Biden Get Right What Trump Got Wrong?

China’s Counter-Strategy to American Export Controls in Integrated Circuits

Agglomerative clusterization with DBSCAN algorithm and iterative method

IS A DIGITAL GREAT WALL BEING BUILT BETWEEN THE U.S. AND CHINA?