Informative Language Encoding by Variational Autoencoders Using Transformer

Ok, Changwon; Lee, Geonseok; Lee, Kichun

doi:10.3390/app12167968

Cited by 5 publications

(4 citation statements)

References 24 publications

(65 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, when the length of the input time series is too long, the LSTM network will also encounter problems such as long training time, slow parameter update and even gradient disappearance (Hochreiter, 1991; Dandwate et al , 2023). At this time, LSTM needs to spend more time training a long sequence, and when using the backpropagation mechanism to update model parameters, a longer sequence will also cause the gradient to gradually decrease to close to zero, and the parameters cannot be updated, thereby losing the key information in the previous time step and the information in front of the long sequence, that is, long-term dependency problems cannot be dealt with (Ok et al , 2022; Mercan et al , 2023). For large LSTM models, a reasonable input sequence length is between 100 and 500 (Xiao et al , 2019).…”

Section: The Proposed Modulation Classification Methodsmentioning

confidence: 99%

TLN-LSTM: an automatic modulation recognition classifier based on a two-layer nested structure of LSTM network for extremely long signal sequences

Qian,

Tu,

Hou

et al. 2024

IJWIS

View full text Add to dashboard Cite

Purpose Automatic modulation recognition (AMR) is a challenging problem in intelligent communication systems and has wide application prospects. At present, although many AMR methods based on deep learning have been proposed, the methods proposed by these works cannot be directly applied to the actual wireless communication scenario, because there are usually two kinds of dilemmas when recognizing the real modulated signal, namely, long sequence and noise. This paper aims to effectively process in-phase quadrature (IQ) sequences of very long signals interfered by noise. Design/methodology/approach This paper proposes a general model for a modulation classifier based on a two-layer nested structure of long short-term memory (LSTM) networks, called a two-layer nested structure (TLN)-LSTM, which exploits the time sensitivity of LSTM and the ability of the nested network structure to extract more features, and can achieve effective processing of ultra-long signal IQ sequences collected from real wireless communication scenarios that are interfered by noise. Findings Experimental results show that our proposed model has higher recognition accuracy for five types of modulation signals, including amplitude modulation, frequency modulation, gaussian minimum shift keying, quadrature phase shift keying and differential quadrature phase shift keying, collected from real wireless communication scenarios. The overall classification accuracy of the proposed model for these signals can reach 73.11%, compared with 40.84% for the baseline model. Moreover, this model can also achieve high classification performance for analog signals with the same modulation method in the public data set HKDD_AMC36. Originality/value At present, although many AMR methods based on deep learning have been proposed, these works are based on the model’s classification results of various modulated signals in the AMR public data set to evaluate the signal recognition performance of the proposed method rather than collecting real modulated signals for identification in actual wireless communication scenarios. The methods proposed in these works cannot be directly applied to actual wireless communication scenarios. Therefore, this paper proposes a new AMR method, dedicated to the effective processing of the collected ultra-long signal IQ sequences that are interfered by noise.

show abstract

Section: The Proposed Modulation Classification Methodsmentioning

confidence: 99%

TLN-LSTM: an automatic modulation recognition classifier based on a two-layer nested structure of LSTM network for extremely long signal sequences

Qian,

Tu,

Hou

et al. 2024

IJWIS

View full text Add to dashboard Cite

show abstract

“…Biesner et al [44] integrate the token vectors of each word from a Transformer encoder using an RNN to obtain a single latent variable. On the other hand, Ok et al [45] encodes the token vector of the sentence representation into a latent variable through a simple S5.…”

Section: Transformer-based Vaementioning

confidence: 99%

A Tree-Transformer based VAE with fragment tokenization for large chemical models

Inukai,

Yamato,

Akiyama

et al. 2024

Preprint

View full text Add to dashboard Cite

Chemical language model (CLM), a molecular generation model, leverages large language models by utilizing SMILES, a string representation of compounds. Chemical variational auto-encoder (VAE), which explicitly constructs a latent space, demonstrates their strength in molecular optimization and generation on a continuous space. We propose the Fragment Tree-Transformer based VAE (FRATTVAE), which treats molecules as tree structures with fragments as nodes. This method allows for fragment tokens, unmanageable with SMILES, and effectively handles large molecules, including salts and solvents. Utilizing tree positional encoding and Extended Connectivity Fingerprints (ECFP) for fragment token embedding, and applying the Transformer architecture, FRATTVAE achieves superior molecule generation accuracy and computational speed. Distribution learning across various benchmark datasets, from small molecules to natural compounds, showed that FRATTVAE consistently achieved high accuracy in all metrics while balancing reconstruction accuracy and generation quality. In molecular optimization tasks, FRATTVAE generated high-quality, stable molecules with desired properties, avoiding structural alerts.

show abstract

“…Biesner et al [41] integrate the token vectors of each word from a Transformer encoder using an RNN to obtain a single latent variable. On the other hand, Ok et al [42] encodes the token vector of the sentence representation into a latent variable through a simple linear layer. Since sequential processing with RNNs undermines the advantage of parallel processing offered by the Transformer, our model is based on the approach of Ok et al to construct the VAE model.…”

Section: Transformer-based Vaementioning

confidence: 99%

A Tree-Transformer based VAE with fragment tokenization for large chemical models

Inukai,

Yamato,

Akiyama

et al. 2024

Preprint

View full text Add to dashboard Cite

Chemical language model (CLM), a molecular generation model, leverages large language models by utilizing SMILES, a string representation of compounds. Chemical variational auto-encoder (VAE), which explicitly constructs a latent space, demonstrates their strength in molecular optimization and generation on a continuous space. We propose the Fragment Tree-Transformer based VAE (FRATTVAE) for the task of molecular optimization and generation, which treats molecules as a tree structure with fragments as nodes. Representing compounds as fragment trees allows for the use of fragments as tokens, which are not manageable with SMILES representations, and facilitates the handling of molecules of large size that include salts and solvents. By utilizing a tree positional encoding method for tree structures and applying the Transformer to these encoded structures, we effectively extract the dependencies among fragments through self-attention mechanism, achieving the accuracy of molecule generation and computational speed that surpasses existing methods. To demonstrate the performance of FRATTVAE, we conducted distribution learning and molecular optimization, fundamental tasks of molecular generation. Distribution learning showed that, across a wide range of benchmark datasets from small molecules to natural compounds with differing required properties, the reconstruction accuracy and generation evaluation metrics of FRATTVAE were superior to those of the state-of-the-arts methods. In molecular optimization tasks, FRATTVAE generated high-quality, stable molecules with desired properties that do not trigger structural alerts.

show abstract

Informative Language Encoding by Variational Autoencoders Using Transformer

Cited by 5 publications

References 24 publications

TLN-LSTM: an automatic modulation recognition classifier based on a two-layer nested structure of LSTM network for extremely long signal sequences

TLN-LSTM: an automatic modulation recognition classifier based on a two-layer nested structure of LSTM network for extremely long signal sequences

A Tree-Transformer based VAE with fragment tokenization for large chemical models

A Tree-Transformer based VAE with fragment tokenization for large chemical models

Contact Info

Product

Resources

About