Google has set up a new milestone for speech generation: "Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model"
You can listen to generated samples at: https://google.github.io/tacotron/
Paper: https://arxiv.org/abs/1703.10135
#audio #arxiv #google #breakthrough #generative
You can listen to generated samples at: https://google.github.io/tacotron/
Paper: https://arxiv.org/abs/1703.10135
#audio #arxiv #google #breakthrough #generative
arXiv.org
Tacotron: Towards End-to-End Speech Synthesis
A text-to-speech synthesis system typically consists of multiple stages, such as a text analysis frontend, an acoustic model and an audio synthesis module. Building these components often requires...
Your Classifier is Secretly an Energy-Based Model and You Should Treat it Like One
Classifiers are secretly energy-based models! Every softmax giving
The authors did math tricks to the joint Energy-Based Model (EBM) to a usual #classifier with #softmax. So it turns that in this usual classifier hiding the EBM.
Our key observation in this work is that one can slightly re-interpret the logits obtained from
In their result, they show that approach gives more adversarial robustness and for generated images maximize class confidence than its classified. It also gives more unify the density of confidence classes of all data. And of course, it can generate pictures. Interesting result by math trick!
paper: http://arxiv.org/abs/1912.03263
tweet: http://twitter.com/DavidDuvenaud/status/1204143678865866752
github: https://github.com/wgrathwohl/JEM
Classifiers are secretly energy-based models! Every softmax giving
p(c|x)
has an unused degree of freedom, which we use to compute the input density p(x).
This makes classifiers into #generative models without changing the architecture.The authors did math tricks to the joint Energy-Based Model (EBM) to a usual #classifier with #softmax. So it turns that in this usual classifier hiding the EBM.
Our key observation in this work is that one can slightly re-interpret the logits obtained from
fθ
to define p(x, y)
and p(x)
as well. Without changing fθ,
one can re-use the logits to define an energy-based model of the joint distribution of data point x
and labels y.
The normalizing constant cancels out, yielding the standard Softmax parameterization. Thus, we have found a generative model hidden within every standard discriminative model!In their result, they show that approach gives more adversarial robustness and for generated images maximize class confidence than its classified. It also gives more unify the density of confidence classes of all data. And of course, it can generate pictures. Interesting result by math trick!
paper: http://arxiv.org/abs/1912.03263
tweet: http://twitter.com/DavidDuvenaud/status/1204143678865866752
github: https://github.com/wgrathwohl/JEM
🔥Logo generation autonomous system was revealed to be used in production for almost a year.
Leading Russia-based design studio Artlebedev revealed that they experimented with using neural networks and set of algorithmic systems to design logotypes for real customers. They named system Nikolay Ironov (in russian N.Ironov sounds close to Neuronov). The system realeased 17 commercial projects, which were welcomed by the audience.
Mishief managed! 😈
Link: https://www.artlebedev.com/ironov/
Project portfolio: https://www.artlebedev.ru/nikolay-ironov/
#GAN #design #logotypes #logo #generation #generative #artlebedev
Leading Russia-based design studio Artlebedev revealed that they experimented with using neural networks and set of algorithmic systems to design logotypes for real customers. They named system Nikolay Ironov (in russian N.Ironov sounds close to Neuronov). The system realeased 17 commercial projects, which were welcomed by the audience.
Mishief managed! 😈
Link: https://www.artlebedev.com/ironov/
Project portfolio: https://www.artlebedev.ru/nikolay-ironov/
#GAN #design #logotypes #logo #generation #generative #artlebedev
Generative Agents: Interactive Simulacra of Human Behavior
Imagine a world where computational software agents can simulate believable human behavior, empowering a wide range of interactive applications from immersive environments to rehearsal spaces for interpersonal communication and prototyping tools. This paper introduces "generative agents," a groundbreaking concept where agents perform daily routines, engage in creative activities, form opinions, interact with others, and remember and reflect on their experiences as they plan their next day.
To bring generative agents to life, the authors propose an innovative architecture that extends a large language model, allowing agents to store and reflect on their experiences using natural language and dynamically plan their behavior. They showcase the potential of generative agents in an interactive sandbox environment inspired by The Sims, where users can engage with a small town of 25 agents using natural language. The evaluation highlights the agents' ability to autonomously create and navigate complex social situations, producing believable individual and emergent social behaviors. This groundbreaking work demonstrates the critical contributions of observation, planning, and reflection components in agent architecture, laying the foundation for more realistic simulations of human behavior and unlocking exciting possibilities across various applications.
Paper link: https://arxiv.org/abs/2304.03442
Demo link: https://reverie.herokuapp.com/arXiv_Demo/#
A detailed unofficial overview of the paper: https://andlukyane.com/blog/paper-review-ishb
#deeplearning #nlp #generative # simulation
Imagine a world where computational software agents can simulate believable human behavior, empowering a wide range of interactive applications from immersive environments to rehearsal spaces for interpersonal communication and prototyping tools. This paper introduces "generative agents," a groundbreaking concept where agents perform daily routines, engage in creative activities, form opinions, interact with others, and remember and reflect on their experiences as they plan their next day.
To bring generative agents to life, the authors propose an innovative architecture that extends a large language model, allowing agents to store and reflect on their experiences using natural language and dynamically plan their behavior. They showcase the potential of generative agents in an interactive sandbox environment inspired by The Sims, where users can engage with a small town of 25 agents using natural language. The evaluation highlights the agents' ability to autonomously create and navigate complex social situations, producing believable individual and emergent social behaviors. This groundbreaking work demonstrates the critical contributions of observation, planning, and reflection components in agent architecture, laying the foundation for more realistic simulations of human behavior and unlocking exciting possibilities across various applications.
Paper link: https://arxiv.org/abs/2304.03442
Demo link: https://reverie.herokuapp.com/arXiv_Demo/#
A detailed unofficial overview of the paper: https://andlukyane.com/blog/paper-review-ishb
#deeplearning #nlp #generative # simulation