Pair Encoding Research Articles

Morphologically rich and complex languages such as Arabic, pose a major challenge to neural machine translation (NMT) due to the large number of rare words and the inability of NMT to translate them. Unknown word (UNK) symbols are used to represent out-of-vocabulary words because NMT typically operates with a fixed vocabulary size. These rare words can be effectively encoded as sequences of subword units by using algorithms, such as byte pair encoding (BPE), to tackle the UNK problem. However, for languages with highly inflected and morphological variations, such as Arabic, the aforementioned method has its own limitations that make it not effective enough for translation quality. To alleviate the UNK problem and address the inconvenient behavior of BPE when translating the Arabic language, we propose to utilize a romanization system that converts Arabic scripts to subword units. We investigate the effect of our approach on NMT performance under various segmentation scenarios and compare the results with systems trained on original Arabic form. In addition, we integrate Romanized Arabic as an input factor for Arabic-sourced NMT compared with well-known factors, namely, lemma, part-of-speech tags, and morph features. Extensive experiments on Arabic-Chinese translation demonstrate that the proposed approaches can effectively tackle the UNK problem and significantly improve the translation quality for Arabic-sourced translation. Additional experiments in this study focus on developing the NMT system on Chinese-Arabic translation. Before implementing our experiments, we first propose standard criteria for the data filtering of a parallel corpus, which helps in filtering out its noise.

Read full abstract

Fog Computing, a technology that takes advantage of both the paradigms of Cloud Computing and the Internet of Things, has a great advantage in reducing the communication cost. Since its introduction, fog computing has found a lot of applications, including, for instance, connected vehicles, wireless sensors, smart cities and etc. One prominent problem in fog computing is how fine-grained access control can be imposed. Functional encryption, a new cryptographic primitive, is known to support fine-grained access control. However, when it comes to some new threats in the fog computing scenario, such as side channel attacks, functional encryption cannot maintain its security. Therefore, we need new cryptographic primitives that not only provide a way to securely share data with a fine-grained access control but also are able to resist those new threats.In this paper, we consider how to construct functional encryption schemes (FEs) adaptively secure in continual memory leakage model (CML), which is one of the strongest models that allows continuous leakage on both user and master secret keys. Besides providing privacy and fine-grained access control in fog computing, our scheme can also guarantee security against side channel attacks. More concretely, we propose a generic framework for constructing fully secure leakage-resilient FEs (LR-FEs) in the CML model results from leakage-resilient pair encoding, which is an extension of pair encoding presented in the recent work of Attrapadung. In this way, our framework simplifies the design and analysis of LR-FEs into the design and analysis of predicate encodings. Moreover, we discover new adaptively secure LR-FEs, including FE for regular languages, attribute-based encryption (ABE) for large universe and ABE with short ciphertext. Above all, leakage-resilient adaptively secure functional encryption schemes can equip fog computing with higher security and fine-grained access control.

Read full abstract

Pair Encoding Research Articles

Related Topics

Articles published on Pair Encoding

Open Vocabulary Arabic Diacritics Restoration

Predicate signatures from pair encodings via dual system proof technique

Arabic–Chinese Neural Machine Translation: Romanized Arabic as Subword Unit for Arabic-sourced Translation

Hierarchical Transfer Learning Architecture for Low-Resource Neural Machine Translation

Four-level phase pair encoding and decoding with single interferometric phase retrieval for holographic data storage

Towards leakage-resilient fine-grained access control in fog computing

Functional encryption for computational hiding in prime order groups via pair encodings

Byte Pair Transformation using Zero-Frequency Bytes with Varying Number of Passes

A new method of fast compression of program code for ota updates in consumer devices

Transcriptional Analysis and Functional Characterization of a Gene Pair Encoding Iron-Regulated Xenocin and Immunity Proteins of Xenorhabdus nematophila

Associative and strategic components of episodic memory: A life-span dissociation.

Speeding Up HMM Decoding and Training by Exploiting Sequence Repetitions

Associative Memory Encoding and Recognition in Schizophrenia: An Event-Related fMRI Study

String Matching Over Compressed Text on Handheld Devices Using Tagged Sub-Optimal Code (TSC)

Vector quantization of speech line spectrum pair parameters and reflection coefficients

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Pair Encoding Research Articles

Related Topics

Articles published on Pair Encoding

Open Vocabulary Arabic Diacritics Restoration

Predicate signatures from pair encodings via dual system proof technique

Arabic–Chinese Neural Machine Translation: Romanized Arabic as Subword Unit for Arabic-sourced Translation

Hierarchical Transfer Learning Architecture for Low-Resource Neural Machine Translation

Four-level phase pair encoding and decoding with single interferometric phase retrieval for holographic data storage

Towards leakage-resilient fine-grained access control in fog computing

Functional encryption for computational hiding in prime order groups via pair encodings

Byte Pair Transformation using Zero-Frequency Bytes with Varying Number of Passes

A new method of fast compression of program code for ota updates in consumer devices

Transcriptional Analysis and Functional Characterization of a Gene Pair Encoding Iron-Regulated Xenocin and Immunity Proteins of Xenorhabdus nematophila

Associative and strategic components of episodic memory: A life-span dissociation.

Speeding Up HMM Decoding and Training by Exploiting Sequence Repetitions

Associative Memory Encoding and Recognition in Schizophrenia: An Event-Related fMRI Study

String Matching Over Compressed Text on Handheld Devices Using Tagged Sub-Optimal Code (TSC)

Vector quantization of speech line spectrum pair parameters and reflection coefficients