Advanced search in
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
2,016 Research products, page 1 of 202

  • Publications
  • Research data
  • Other research products
  • AE
  • arXiv.org e-Print Archive

10
arrow_drop_down
Date (most recent)
arrow_drop_down
  • Open Access English
    Authors: 
    Nadia Figueroa; Haiwei Dong; Abdulmotaleb El Saddik;
    Country: Switzerland

    We propose a 6D RGB-D odometry approach that finds the relative camera pose between consecutive RGB-D frames by keypoint extraction and feature matching both on the RGB and depth image planes. Furthermore, we feed the estimated pose to the highly accurate KinectFusion algorithm, which uses a fast ICP (Iterative Closest Point) to fine-tune the frame-to-frame relative pose and fuse the depth data into a global implicit surface. We evaluate our method on a publicly available RGB-D SLAM benchmark dataset by Sturm et al. The experimental results show that our proposed reconstruction method solely based on visual odometry and KinectFusion outperforms the state-of-the-art RGB-D SLAM system accuracy. Moreover, our algorithm outputs a ready-to-use polygon mesh (highly suitable for creating 3D virtual worlds) without any postprocessing steps.

  • Publication . Preprint . Article . 2022
    Open Access
    Authors: 
    Mingbao Lin; Rongrong Ji; Zihan Xu; Baochang Zhang; Fei Chao; Chia-Wen Lin; Ling Shao;
    Publisher: Institute of Electrical and Electronics Engineers (IEEE)

    Binary neural networks (BNNs) have attracted broad research interest due to their efficient storage and computational ability. Nevertheless, a significant challenge of BNNs lies in handling discrete constraints while ensuring bit entropy maximization, which typically makes their weight optimization very difficult. Existing methods relax the learning using the sign function, which simply encodes positive weights into +1s, and -1s otherwise. Alternatively, we formulate an angle alignment objective to constrain the weight binarization to {0,+1} to solve the challenge. In this paper, we show that our weight binarization provides an analytical solution by encoding high-magnitude weights into +1s, and 0s otherwise. Therefore, a high-quality discrete solution is established in a computationally efficient manner without the sign function. We prove that the learned weights of binarized networks roughly follow a Laplacian distribution that does not allow entropy maximization, and further demonstrate that it can be effectively solved by simply removing the $\ell_2$ regularization during network training. Our method, dubbed sign-to-magnitude network binarization (SiMaN), is evaluated on CIFAR-10 and ImageNet, demonstrating its superiority over the sign-based state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/SiMaN. Comment: Accepted by IEEE TPAMI, 2022

  • Publication . Article . Other literature type . Preprint . 2022 . Embargo End Date: 01 Jan 2021
    Open Access
    Authors: 
    J. van den Eijnden; Ian Heywood; Rob Fender; Shazrene Mohamed; G. R. Sivakoff; Payaswini Saikia; Thomas D. Russell; Sara Motta; James Miller-Jones; Patrick A. Woudt;
    Publisher: arXiv
    Country: United Kingdom
    Project: NSERC

    Vela X-1 is a runaway X-ray binary system hosting a massive donor star, whose strong stellar wind creates a bow shock as it interacts with the interstellar medium. This bow shock has previously been detected in H$\alpha$ and IR, but, similar to all but one bow shock from a massive runaway star (BD+43$^{\rm o}$3654), has escaped detection in other wavebands. We report on the discovery of $1.3$ GHz radio emission from the Vela X-1 bow shock with the MeerKAT telescope. The MeerKAT observations reveal how the radio emission closely traces the H$\alpha$ line emission, both in the bow shock and in the larger-scale diffuse structures known from existing H$\alpha$ surveys. The Vela X-1 bow shock is the first stellar-wind-driven radio bow shock detected around an X-ray binary. In the absence of a radio spectral index measurement, we explore other avenues to constrain the radio emission mechanism. We find that thermal/free-free emission can account for the radio and H$\alpha$ properties, for a combination of electron temperature and density consistent with earlier estimates of ISM density and the shock enhancement. In this explanation, the presence of a local ISM over-density is essential for the detection of radio emission. Alternatively, we consider a non-thermal/synchrotron scenario, evaluating the magnetic field and broad-band spectrum of the shock. However, we find that exceptionally high fractions ($\gtrsim 13$%) of the kinetic wind power would need to be injected into the relativistic electron population to explain the radio emission. Assuming lower fractions implies a hybrid scenario, dominated by free-free radio emission. Finally, we speculate about the detectability of radio bow shocks and whether it requires exceptional ISM or stellar wind properties. Comment: 17 pages including appendices, 12 figures; accepted for publication in MNRAS. Updated version with minor corrections to reference list

  • Publication . Conference object . Article . Preprint . 2022
    Open Access
    Authors: 
    Sun, Yujia; Wang, Shuo; Chen, Chenglizhao; Xiang, Tian-Zhu;
    Publisher: International Joint Conferences on Artificial Intelligence Organization

    Camouflaged object detection (COD), segmenting objects that are elegantly blended into their surroundings, is a valuable yet challenging task. Existing deep-learning methods often fall into the difficulty of accurately identifying the camouflaged object with complete and fine object structure. To this end, in this paper, we propose a novel boundary-guided network (BGNet) for camouflaged object detection. Our method explores valuable and extra object-related edge semantics to guide representation learning of COD, which forces the model to generate features that highlight object structure, thereby promoting camouflaged object detection of accurate boundary localization. Extensive experiments on three challenging benchmark datasets demonstrate that our BGNet significantly outperforms the existing 18 state-of-the-art methods under four widely-used evaluation metrics. Our code is publicly available at: https://github.com/thograce/BGNet. Accepted by IJCAI2022

  • Publication . Preprint . Article . 2022
    Open Access English
    Authors: 
    Lu Yu; Shichao Pei; Lizhong Ding; Jun Zhou; Longfei Li; Chuxu Zhang; Xiangliang Zhang;

    This paper studies learning node representations with graph neural networks (GNNs) for unsupervised scenario. Specifically, we derive a theoretical analysis and provide an empirical demonstration about the non-steady performance of GNNs over different graph datasets, when the supervision signals are not appropriately defined. The performance of GNNs depends on both the node feature smoothness and the locality of graph structure. To smooth the discrepancy of node proximity measured by graph topology and node feature, we proposed SAIL - a novel \underline{S}elf-\underline{A}ugmented graph contrast\underline{i}ve \underline{L}earning framework, with two complementary self-distilling regularization modules, \emph{i.e.}, intra- and inter-graph knowledge distillation. We demonstrate the competitive performance of SAIL on a variety of graph applications. Even with a single GNN layer, SAIL has consistently competitive or even better performance on various benchmark datasets, comparing with state-of-the-art baselines. Accepted by AAAI2022, 10 pages, 3 figures

  • Publication . Article . Preprint . 2022 . Embargo End Date: 01 Jan 2022
    Open Access
    Authors: 
    Benaoum, H. B.; Genly Leon; Ovgun, A.; Quevedo, H.;
    Publisher: arXiv

    We investigate the inflation driven by a non-linear electromagnetic field based on a NLED lagrangian density ${\cal L}_{nled} = - {\cal F} f \left( {\cal F} \right)$, where $f \left( {\cal F}\right)$ is a generalized functional depending on ${\cal F}$. We first formulate an $f$-NLED cosmological model with a more general functional $f \left( {\cal F}\right)$ and show that all NLED models can be expressed in this framework; then, we investigate in details two interesting examples of the functional $f \left( {\cal F}\right)$. We present our phenomenological model based on a new Lagrangian for NLED. Solutions to the field equations with the physical properties of the cosmological parameters are obtained. We show that the early Universe had no Big-Bang singularity, which accelerated in the past. We also investigate the qualitative implications of NLED by studying the inflationary parameters, like the slow-roll parameters, spectral index $n_s$, and tensor-to-scalar ratio $r$ and compare our results with observational data. Detailed phase-space analysis of our NLED cosmological model is performed with and without matter source. As a first approach, we consider the motion of a particle of unit mass in an effective potential. Our systems correspond to fast-slow systems for physical values of the electromagnetic field and the energy densities at the end of inflation. We analyze a complementary system using Hubble-normalized variables to investigate the cosmological evolution previous to the matter-dominated Universe. Comment: 30 pages, 14 figures

  • Publication . Other literature type . Preprint . Article . 2022 . Embargo End Date: 01 Jan 2022
    Open Access
    Authors: 
    Petrini, Leonardo; Cagnetta, Francesco; Vanden-Eijnden, Eric; Wyart, Matthieu;
    Publisher: arXiv
    Country: Switzerland
    Project: NSF | DMS-EPSRC Collaborative R... (2012510), NSF | Statistical and Computati... (2134216), NSF | NYU Materials Research Sc... (1420073)

    It is widely believed that the success of deep networks lies in their ability to learn a meaningful representation of the features of the data. Yet, understanding when and how this feature learning improves performance remains a challenge: for example, it is beneficial for modern architectures trained to classify images, whereas it is detrimental for fully-connected networks trained for the same task on the same data. Here we propose an explanation for this puzzle, by showing that feature learning can perform worse than lazy training (via random feature kernel or the NTK) as the former can lead to a sparser neural representation. Although sparsity is known to be essential for learning anisotropic data, it is detrimental when the target function is constant or smooth along certain directions of input space. We illustrate this phenomenon in two settings: (i) regression of Gaussian random functions on the d-dimensional unit sphere and (ii) classification of benchmark datasets of images. For (i), we compute the scaling of the generalization error with number of training points, and show that methods that do not learn features generalize better, even when the dimension of the input space is large. For (ii), we show empirically that learning features can indeed lead to sparse and thereby less smooth representations of the image predictors. This fact is plausibly responsible for deteriorating the performance, which is known to be correlated with smoothness along diffeomorphisms.

  • Open Access English
    Authors: 
    Deng, Mingkai; Wang, Jianyu; Hsieh, Cheng-Ping; Wang, Yihan; Guo, Han; Shu, Tianmin; Song, Meng; Xing, Eric P.; Hu, Zhiting;
    Project: NSF | Collaborative Research: S... (2123952)

    Prompting has shown impressive success in enabling large pretrained language models (LMs) to perform diverse NLP tasks, especially when only few downstream data are available. Automatically finding the optimal prompt for each task, however, is challenging. Most existing work resorts to tuning soft prompt (e.g., embeddings) which falls short of interpretability, reusability across LMs, and applicability when gradients are not accessible. Discrete prompt, on the other hand, is difficult to optimize, and is often created by "enumeration (e.g., paraphrasing)-then-selection" heuristics that do not explore the prompt space systematically. This paper proposes RLPrompt, an efficient discrete prompt optimization approach with reinforcement learning (RL). RLPrompt formulates a parameter-efficient policy network that generates the desired discrete prompt after training with reward. To overcome the complexity and stochasticity of reward signals by the large LM environment, we incorporate effective reward stabilization that substantially enhances the training efficiency. RLPrompt is flexibly applicable to different types of LMs, such as masked (e.g., BERT) and left-to-right models (e.g., GPTs), for both classification and generation tasks. Experiments on few-shot classification and unsupervised text style transfer show superior performance over a wide range of existing finetuning or prompting methods. Interestingly, the resulting optimized prompts are often ungrammatical gibberish text; and surprisingly, those gibberish prompts are transferrable between different LMs to retain significant performance, indicating LM prompting may not follow human language patterns. EMNLP 2022 Camera Ready. Code available at https://github.com/mingkaid/rl-prompt

  • Publication . Article . Preprint . 2022 . Embargo End Date: 01 Jan 2022
    Open Access
    Authors: 
    Mohammad Karimzadeh-Farshbafan; Walid Saad; Merouane Debbah;
    Publisher: arXiv

    Goal-oriented semantic communication will be a pillar of next-generation wireless networks. Despite significant recent efforts in this area, most prior works are focused on specific data types (e.g., image or audio), and they ignore the goal and effectiveness aspects of semantic transmissions. In contrast, in this paper, a holistic goal-oriented semantic communication framework is proposed to enable a speaker and a listener to cooperatively execute a set of sequential tasks in a dynamic environment. A common language based on a hierarchical belief set is proposed to enable semantic communications between speaker and listener. The speaker, acting as an observer of the environment, utilizes the beliefs to transmit an initial description of its observation (called event) to the listener. The listener is then able to infer on the transmitted description and complete it by adding related beliefs to the transmitted beliefs of the speaker. As such, the listener reconstructs the observed event based on the completed description, and it then takes appropriate action in the environment based on the reconstructed event. An optimization problem is defined to determine the perfect and abstract description of the events while minimizing the transmission and inference costs with constraints on the task execution time and belief efficiency. Then, a novel bottom-up curriculum learning (CL) framework based on reinforcement learning is proposed to solve the optimization problem and enable the speaker and listener to gradually identify the structure of the belief set and the perfect and abstract description of the events. Simulation results show that the proposed CL method outperforms traditional RL in terms of convergence time, task execution cost and time, reliability, and belief efficiency.

  • Open Access
    Authors: 
    Filipe Rodrigues; Nicola Ortelli; Michel Bierlaire; Francisco C. Pereira;
    Publisher: Institute of Electrical and Electronics Engineers (IEEE)
    Countries: Switzerland, Denmark

    Specifying utility functions is a key step towards applying the discrete choice framework for understanding the behaviour processes that govern user choices. However, identifying the utility function specifications that best model and explain the observed choices can be a very challenging and time-consuming task. This paper seeks to help modellers by leveraging the Bayesian framework and the concept of automatic relevance determination (ARD), in order to automatically determine an optimal utility function specification from an exponentially large set of possible specifications in a purely data-driven manner. Based on recent advances in approximate Bayesian inference, a doubly stochastic variational inference is developed, which allows the proposed DCM-ARD model to scale to very large and high-dimensional datasets. Using semi-artificial choice data, the proposed approach is shown to very accurately recover the true utility function specifications that govern the observed choices. Moreover, when applied to real choice data, DCM-ARD is shown to be able discover high quality specifications that can outperform previous ones from the literature according to multiple criteria, thereby demonstrating its practical applicability. 21 pages, 2 figures, 11 tables

Advanced search in
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
2,016 Research products, page 1 of 202
  • Open Access English
    Authors: 
    Nadia Figueroa; Haiwei Dong; Abdulmotaleb El Saddik;
    Country: Switzerland

    We propose a 6D RGB-D odometry approach that finds the relative camera pose between consecutive RGB-D frames by keypoint extraction and feature matching both on the RGB and depth image planes. Furthermore, we feed the estimated pose to the highly accurate KinectFusion algorithm, which uses a fast ICP (Iterative Closest Point) to fine-tune the frame-to-frame relative pose and fuse the depth data into a global implicit surface. We evaluate our method on a publicly available RGB-D SLAM benchmark dataset by Sturm et al. The experimental results show that our proposed reconstruction method solely based on visual odometry and KinectFusion outperforms the state-of-the-art RGB-D SLAM system accuracy. Moreover, our algorithm outputs a ready-to-use polygon mesh (highly suitable for creating 3D virtual worlds) without any postprocessing steps.

  • Publication . Preprint . Article . 2022
    Open Access
    Authors: 
    Mingbao Lin; Rongrong Ji; Zihan Xu; Baochang Zhang; Fei Chao; Chia-Wen Lin; Ling Shao;
    Publisher: Institute of Electrical and Electronics Engineers (IEEE)

    Binary neural networks (BNNs) have attracted broad research interest due to their efficient storage and computational ability. Nevertheless, a significant challenge of BNNs lies in handling discrete constraints while ensuring bit entropy maximization, which typically makes their weight optimization very difficult. Existing methods relax the learning using the sign function, which simply encodes positive weights into +1s, and -1s otherwise. Alternatively, we formulate an angle alignment objective to constrain the weight binarization to {0,+1} to solve the challenge. In this paper, we show that our weight binarization provides an analytical solution by encoding high-magnitude weights into +1s, and 0s otherwise. Therefore, a high-quality discrete solution is established in a computationally efficient manner without the sign function. We prove that the learned weights of binarized networks roughly follow a Laplacian distribution that does not allow entropy maximization, and further demonstrate that it can be effectively solved by simply removing the $\ell_2$ regularization during network training. Our method, dubbed sign-to-magnitude network binarization (SiMaN), is evaluated on CIFAR-10 and ImageNet, demonstrating its superiority over the sign-based state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/SiMaN. Comment: Accepted by IEEE TPAMI, 2022

  • Publication . Article . Other literature type . Preprint . 2022 . Embargo End Date: 01 Jan 2021
    Open Access
    Authors: 
    J. van den Eijnden; Ian Heywood; Rob Fender; Shazrene Mohamed; G. R. Sivakoff; Payaswini Saikia; Thomas D. Russell; Sara Motta; James Miller-Jones; Patrick A. Woudt;
    Publisher: arXiv
    Country: United Kingdom
    Project: NSERC

    Vela X-1 is a runaway X-ray binary system hosting a massive donor star, whose strong stellar wind creates a bow shock as it interacts with the interstellar medium. This bow shock has previously been detected in H$\alpha$ and IR, but, similar to all but one bow shock from a massive runaway star (BD+43$^{\rm o}$3654), has escaped detection in other wavebands. We report on the discovery of $1.3$ GHz radio emission from the Vela X-1 bow shock with the MeerKAT telescope. The MeerKAT observations reveal how the radio emission closely traces the H$\alpha$ line emission, both in the bow shock and in the larger-scale diffuse structures known from existing H$\alpha$ surveys. The Vela X-1 bow shock is the first stellar-wind-driven radio bow shock detected around an X-ray binary. In the absence of a radio spectral index measurement, we explore other avenues to constrain the radio emission mechanism. We find that thermal/free-free emission can account for the radio and H$\alpha$ properties, for a combination of electron temperature and density consistent with earlier estimates of ISM density and the shock enhancement. In this explanation, the presence of a local ISM over-density is essential for the detection of radio emission. Alternatively, we consider a non-thermal/synchrotron scenario, evaluating the magnetic field and broad-band spectrum of the shock. However, we find that exceptionally high fractions ($\gtrsim 13$%) of the kinetic wind power would need to be injected into the relativistic electron population to explain the radio emission. Assuming lower fractions implies a hybrid scenario, dominated by free-free radio emission. Finally, we speculate about the detectability of radio bow shocks and whether it requires exceptional ISM or stellar wind properties. Comment: 17 pages including appendices, 12 figures; accepted for publication in MNRAS. Updated version with minor corrections to reference list

  • Publication . Conference object . Article . Preprint . 2022
    Open Access
    Authors: 
    Sun, Yujia; Wang, Shuo; Chen, Chenglizhao; Xiang, Tian-Zhu;
    Publisher: International Joint Conferences on Artificial Intelligence Organization

    Camouflaged object detection (COD), segmenting objects that are elegantly blended into their surroundings, is a valuable yet challenging task. Existing deep-learning methods often fall into the difficulty of accurately identifying the camouflaged object with complete and fine object structure. To this end, in this paper, we propose a novel boundary-guided network (BGNet) for camouflaged object detection. Our method explores valuable and extra object-related edge semantics to guide representation learning of COD, which forces the model to generate features that highlight object structure, thereby promoting camouflaged object detection of accurate boundary localization. Extensive experiments on three challenging benchmark datasets demonstrate that our BGNet significantly outperforms the existing 18 state-of-the-art methods under four widely-used evaluation metrics. Our code is publicly available at: https://github.com/thograce/BGNet. Accepted by IJCAI2022

  • Publication . Preprint . Article . 2022
    Open Access English
    Authors: 
    Lu Yu; Shichao Pei; Lizhong Ding; Jun Zhou; Longfei Li; Chuxu Zhang; Xiangliang Zhang;

    This paper studies learning node representations with graph neural networks (GNNs) for unsupervised scenario. Specifically, we derive a theoretical analysis and provide an empirical demonstration about the non-steady performance of GNNs over different graph datasets, when the supervision signals are not appropriately defined. The performance of GNNs depends on both the node feature smoothness and the locality of graph structure. To smooth the discrepancy of node proximity measured by graph topology and node feature, we proposed SAIL - a novel \underline{S}elf-\underline{A}ugmented graph contrast\underline{i}ve \underline{L}earning framework, with two complementary self-distilling regularization modules, \emph{i.e.}, intra- and inter-graph knowledge distillation. We demonstrate the competitive performance of SAIL on a variety of graph applications. Even with a single GNN layer, SAIL has consistently competitive or even better performance on various benchmark datasets, comparing with state-of-the-art baselines. Accepted by AAAI2022, 10 pages, 3 figures

  • Publication . Article . Preprint . 2022 . Embargo End Date: 01 Jan 2022
    Open Access
    Authors: 
    Benaoum, H. B.; Genly Leon; Ovgun, A.; Quevedo, H.;
    Publisher: arXiv

    We investigate the inflation driven by a non-linear electromagnetic field based on a NLED lagrangian density ${\cal L}_{nled} = - {\cal F} f \left( {\cal F} \right)$, where $f \left( {\cal F}\right)$ is a generalized functional depending on ${\cal F}$. We first formulate an $f$-NLED cosmological model with a more general functional $f \left( {\cal F}\right)$ and show that all NLED models can be expressed in this framework; then, we investigate in details two interesting examples of the functional $f \left( {\cal F}\right)$. We present our phenomenological model based on a new Lagrangian for NLED. Solutions to the field equations with the physical properties of the cosmological parameters are obtained. We show that the early Universe had no Big-Bang singularity, which accelerated in the past. We also investigate the qualitative implications of NLED by studying the inflationary parameters, like the slow-roll parameters, spectral index $n_s$, and tensor-to-scalar ratio $r$ and compare our results with observational data. Detailed phase-space analysis of our NLED cosmological model is performed with and without matter source. As a first approach, we consider the motion of a particle of unit mass in an effective potential. Our systems correspond to fast-slow systems for physical values of the electromagnetic field and the energy densities at the end of inflation. We analyze a complementary system using Hubble-normalized variables to investigate the cosmological evolution previous to the matter-dominated Universe. Comment: 30 pages, 14 figures

  • Publication . Other literature type . Preprint . Article . 2022 . Embargo End Date: 01 Jan 2022
    Open Access
    Authors: 
    Petrini, Leonardo; Cagnetta, Francesco; Vanden-Eijnden, Eric; Wyart, Matthieu;
    Publisher: arXiv
    Country: Switzerland
    Project: NSF | DMS-EPSRC Collaborative R... (2012510), NSF | Statistical and Computati... (2134216), NSF | NYU Materials Research Sc... (1420073)

    It is widely believed that the success of deep networks lies in their ability to learn a meaningful representation of the features of the data. Yet, understanding when and how this feature learning improves performance remains a challenge: for example, it is beneficial for modern architectures trained to classify images, whereas it is detrimental for fully-connected networks trained for the same task on the same data. Here we propose an explanation for this puzzle, by showing that feature learning can perform worse than lazy training (via random feature kernel or the NTK) as the former can lead to a sparser neural representation. Although sparsity is known to be essential for learning anisotropic data, it is detrimental when the target function is constant or smooth along certain directions of input space. We illustrate this phenomenon in two settings: (i) regression of Gaussian random functions on the d-dimensional unit sphere and (ii) classification of benchmark datasets of images. For (i), we compute the scaling of the generalization error with number of training points, and show that methods that do not learn features generalize better, even when the dimension of the input space is large. For (ii), we show empirically that learning features can indeed lead to sparse and thereby less smooth representations of the image predictors. This fact is plausibly responsible for deteriorating the performance, which is known to be correlated with smoothness along diffeomorphisms.

  • Open Access English
    Authors: 
    Deng, Mingkai; Wang, Jianyu; Hsieh, Cheng-Ping; Wang, Yihan; Guo, Han; Shu, Tianmin; Song, Meng; Xing, Eric P.; Hu, Zhiting;
    Project: NSF | Collaborative Research: S... (2123952)

    Prompting has shown impressive success in enabling large pretrained language models (LMs) to perform diverse NLP tasks, especially when only few downstream data are available. Automatically finding the optimal prompt for each task, however, is challenging. Most existing work resorts to tuning soft prompt (e.g., embeddings) which falls short of interpretability, reusability across LMs, and applicability when gradients are not accessible. Discrete prompt, on the other hand, is difficult to optimize, and is often created by "enumeration (e.g., paraphrasing)-then-selection" heuristics that do not explore the prompt space systematically. This paper proposes RLPrompt, an efficient discrete prompt optimization approach with reinforcement learning (RL). RLPrompt formulates a parameter-efficient policy network that generates the desired discrete prompt after training with reward. To overcome the complexity and stochasticity of reward signals by the large LM environment, we incorporate effective reward stabilization that substantially enhances the training efficiency. RLPrompt is flexibly applicable to different types of LMs, such as masked (e.g., BERT) and left-to-right models (e.g., GPTs), for both classification and generation tasks. Experiments on few-shot classification and unsupervised text style transfer show superior performance over a wide range of existing finetuning or prompting methods. Interestingly, the resulting optimized prompts are often ungrammatical gibberish text; and surprisingly, those gibberish prompts are transferrable between different LMs to retain significant performance, indicating LM prompting may not follow human language patterns. EMNLP 2022 Camera Ready. Code available at https://github.com/mingkaid/rl-prompt

  • Publication . Article . Preprint . 2022 . Embargo End Date: 01 Jan 2022
    Open Access
    Authors: 
    Mohammad Karimzadeh-Farshbafan; Walid Saad; Merouane Debbah;
    Publisher: arXiv

    Goal-oriented semantic communication will be a pillar of next-generation wireless networks. Despite significant recent efforts in this area, most prior works are focused on specific data types (e.g., image or audio), and they ignore the goal and effectiveness aspects of semantic transmissions. In contrast, in this paper, a holistic goal-oriented semantic communication framework is proposed to enable a speaker and a listener to cooperatively execute a set of sequential tasks in a dynamic environment. A common language based on a hierarchical belief set is proposed to enable semantic communications between speaker and listener. The speaker, acting as an observer of the environment, utilizes the beliefs to transmit an initial description of its observation (called event) to the listener. The listener is then able to infer on the transmitted description and complete it by adding related beliefs to the transmitted beliefs of the speaker. As such, the listener reconstructs the observed event based on the completed description, and it then takes appropriate action in the environment based on the reconstructed event. An optimization problem is defined to determine the perfect and abstract description of the events while minimizing the transmission and inference costs with constraints on the task execution time and belief efficiency. Then, a novel bottom-up curriculum learning (CL) framework based on reinforcement learning is proposed to solve the optimization problem and enable the speaker and listener to gradually identify the structure of the belief set and the perfect and abstract description of the events. Simulation results show that the proposed CL method outperforms traditional RL in terms of convergence time, task execution cost and time, reliability, and belief efficiency.

  • Open Access
    Authors: 
    Filipe Rodrigues; Nicola Ortelli; Michel Bierlaire; Francisco C. Pereira;
    Publisher: Institute of Electrical and Electronics Engineers (IEEE)
    Countries: Switzerland, Denmark

    Specifying utility functions is a key step towards applying the discrete choice framework for understanding the behaviour processes that govern user choices. However, identifying the utility function specifications that best model and explain the observed choices can be a very challenging and time-consuming task. This paper seeks to help modellers by leveraging the Bayesian framework and the concept of automatic relevance determination (ARD), in order to automatically determine an optimal utility function specification from an exponentially large set of possible specifications in a purely data-driven manner. Based on recent advances in approximate Bayesian inference, a doubly stochastic variational inference is developed, which allows the proposed DCM-ARD model to scale to very large and high-dimensional datasets. Using semi-artificial choice data, the proposed approach is shown to very accurately recover the true utility function specifications that govern the observed choices. Moreover, when applied to real choice data, DCM-ARD is shown to be able discover high quality specifications that can outperform previous ones from the literature according to multiple criteria, thereby demonstrating its practical applicability. 21 pages, 2 figures, 11 tables

Send a message
How can we help?
We usually respond in a few hours.