Find answers to the most common questions about our products.
D-ID’s Creative Reality™ Studio is a self-service platform featuring the best generative AI tools to enable users to create videos with moving and talking avatars. Combining the powers of D-ID’s deep-learning face animation technology with LLM text generation, and text-to-image capabilities, the Creative Reality™ Studio is an all-in-one platform for those seeking to create cutting-edge videos with the power of artificial intelligence.
The Creative Reality™ Studio is available on desktop and mobile.
The Creative Reality™ Studio was developed for businesses and individual content creators who want to use avatars to create AI videos featuring digital humans for a wide range of commercial and creative purposes.
All videos are generated in MP4 format
When using D-ID Creative Reality Studio or D-ID API, the video length is limited to 5 min.
There are three ways to animate faces on the Creative Reality™ Studio
Image prompting is a mix of art and science. Our image-generating software is optimized to produce faces that can be animated in the studio, but there is a lot of room for creativity. To get started, we suggest you select one of the pre-created prompts and try out variations of those. Alternatively, try searching for prompts and inspiration on one of numerous prompt-building platforms available online.
There are three ways to add voice to your video
Our studio lets you add various visual elements, such as backgrounds, videos, and texts to your designs. Each element is set as its own layer and you can determine the order by using the “position” button on the top right side of the screen.
The Canvas Layout function lets you choose a layout that best fits your design, letting you create videos for mobile, social media, presentations, or more. The canvas can be set to wide, square, or vertical. in order to shift from one canvas to another, simply click on the canvas and select between the available options in the top left corner of the design window.
Positioning determines where on the canvas you want the presenter to be.
Transparency lets you choose how opaque you want each element to be.
* Layers are not available for mobile app users.
You can determine whether your avatar will look happy, serious, surprised, or maintain a neutral expression. Click on the video field in the design window and select which emotion you want your presenter to convey. The chosen facial expression will be implemented for the duration of the video.
* Expressions are not available for mobile app users.
Go to the video library and click on your user profile at the bottom left of the screen. Then click on “Switch back to classic editor.”
Pro and API Launch plans receive 1 cloned voice; Advanced and API Scale plans receive 3 voices. The number of voices in Enterprise plans is customizable.
Users can now upload an audio file or record directly from the Studio to create an Instant Cloned Voice. They can also delete a created cloned voice. Users must record a voice consent as part of the audio file they submit (both in the Studio and API).
When using the Creative Reality Studio or the D-ID API, audio size is limited to 10MB and up to 5 minutes.
Supported audio formats – MP3, FLAC, M4A, MP4, WAV
The studio currently supports 119 Languages, along with a variety of accents & speaking styles
You can add breaks in your script by clicking on the stopwatch icon on the bottom of the text box. Each break is 0.5 seconds long.
It is important for us, as a company that enables users to create AI-based content, that there is transparency about the synthetic nature of the videos they generate. This is also reflected in our ethical manifesto, available at https://www.d-id.com/ethics , and applicable terms of use.
Depends on your plan:
Please follow our image guidelines:
– Facing camera, medium shot
– Neutral expression, closed mouth
– Minimum head size 200×200 pixels
– Good and consistent lighting
– Up to 10MB
– No face occlusions (hats, sunglasses, masks, visors, large earrings)
There are two possible reasons:
A. The image you are trying to use failed to pass our built-in moderation process. Moderation is carried out by a 3rd party tool and bypassing it is only allowed for Advanced and Enterprise customers, provided they use their own moderation solution.
Advanced plan users have the option to request a manual review.
B. Our system did not detect a face in the provided image. This may happen when trying to animate animals, cartoons, anime figures.
This probably happened because our built-in moderation detected a violation and has therefore blocked the video from being generated. To overcome this, please remove the problematic content and try again.
Each credit is worth up to 15 seconds of video. When generating longer videos, credits add up according to the length of the generated video. For example, a 40-second video consumes 3 credits.
For streaming customers using our API, the price of credits is halved.
Visit https://www.d-id.com/pricing/api/ for more details.
On each plan (Lite, Pro, Advanced) you have 3 packages to choose from, with different amounts of credits, so that if you finish your credits before the end of the month you can choose the bigger package that will allow more credits.
* Bigger packages are not available for mobile app users.
Credits do not accumulate, they are renewed every month and unused credits become void.
If you wish to upgrade your plan, you can do that via the Pricing page.
For details regarding the Enterprise plan please contact our sales team.
You can cancel your monthly subscription any time on the “Account & API” page. To access the page, click the menu on the bottom left and press “Account & API” > Plan and billing > Cancel Plan
Mobile users can unsubscribe through their store settings.
If you unsubscribe, your videos will still be accessible when you login to the studio. Your remaining credits will remain valid until the end of the current billing period.
To delete your account, please contact our support team at support@d-id.com.
Mobile studio users can delete their accounts on the “account settings” page.
Please go to the Account page in the studio, and generate your API key. Note that it is mandatory to have valid credits in your account to use the API.
Credits used for the API are taken from the same balance as the studio.
API documentation is available at the Developer Hub.
Agents are autonomous AI assistants that can answer questions based on the knowledge uploaded by their owner, and perform a specific role or task that’s helpful for business or individual use cases.
Anyone can create an agent, without any knowledge of coding. Creating an agent is as easy as selecting a role, giving the agent instructions and uploading knowledge documents. Users need to sign up or be logged into their D-ID Studio account to create an agent.
Agents are excellent for roles in marketing, customer engagement or education, and training. Agents can simulate real people and fictional characters, or they can be virtual influencers that represent famous brands or individuals.
Agents can help companies boost sales, answer their customers’ questions or chat with their followers. Each agent is an expert in a different area, with access to a specific knowledge base. You can talk with an agent to find out exactly who they are and what their role is.
You can talk with Agents by typing in your question in the text input box, or by clicking the microphone icon and talking with the Agent just like you would talk with another person (available on Chrome/Safari browsers or most mobile devices).
Yes, agents support many major languages such as Hindi, Spanish, French, German, Portuguese etc. Just start talking with an agent in another language, and it will reply back in that language, if it has a multilingual voice enabled.
You can use standard voices, as well as high quality (Pro) voices from ElevenLabs, which are identified by the Pro icon in the Voices selection menu. You can also select a number of native voices for other languages, as well as multilingual voices that can speak several languages. You can also clone your own voice by uploading an audio recording.
Certainly, you can have many other people talk with your agent. You can either share a link to your agent, hosted by D-ID, or you can embed an agent on your own website. Keep in mind that when you share an agent with other users, their conversations with your agent will be charged against your account.
Agents use natural language processing and generative AI to understand your text or voice input and then provide relevant responses. They use RAG technology to retrieve accurate answers to queries from a knowledge base of uploaded documents.
The documents that you upload will provide a knowledge base for your Agent to draw from that is not available to the LLM used by the agent. For example, your documents may have proprietary or non-public information.
Your documents can be PDF or TXT or PPTX (Powerpoint) files that add to the expertise of your Agent. Website URLs are also supported, so you can upload the text content from a web page. For optimal results, you should upload documents that contain paragraphs of text, in the style of an article or FAQ document.
You can upload up to 5 documents, and each document can have a maximum of 500,000 text characters.
Your documents can only be accessed by you and your agents. If you share your agent with other users, then they can also learn about the content of your documents by talking with the agent. For more detailed information, please read our privacy policy.
Yes, you can edit the agent details and text settings and update the knowledge base of your agent.
D-ID is offering everyone 200 free conversation sessions that you can use to get started. After that, the number of conversation sessions depends on the price plan you have selected.
Users can receive up to 5 messages from an agent in each conversation session. The sixth message onwards counts as a new session in your price plan.
You can start on a free trial plan to try out Agents, and then select a price plan that suits you from the D–ID pricing page.
Yes, an API is available to everyone who has a D-ID Studio account, and the corresponding price plans are on the D–ID pricing page.
D-ID Video Translate allows effortless production of multilingual content, without the hassle and expense of traditional video production. It allows users to upload a video in one language and receive a translated version in multiple other languages, including accurate lip-syncing to match and voice cloning.
After uploading a video, our AI-powered system processes the content, translates the spoken words into the selected language, and then re-synchronizes the lip movements to match the new audio generated with the original voice, ensuring the video looks as natural as possible in the new language.
Read more about Video Translate here
For optimal results, the video should feature only one person in the frame, with the speaker facing forward and their face clearly visible at all times. To maintain clear audio, it’s important to minimize background noise and music.
Translation times vary depending on the length of the video. Typically, the process takes anywhere from a few minutes to an hour for longer videos. An email will be sent once your translation is ready.
Once the video is translated, you can download it and make further edits using your preferred video editing software.
Yes. The video should be between 10 seconds and 5 minutes in length, and the file size should not exceed 2GB.
For trial users, credits used for Video Translation are taken from the same balance as the Studio. Each credit is worth up to 15 seconds of translated video, same as Studio credits. For example, translating a 15 sec video to two languages will cost 2 credits.
Read more about the pricing here.
All data communications into and from our services are SSL encrypted (TLS 1.3 Protocol). Data at rest is encrypted in a Transparent Data Encryption S3 Storage. Servers and storage are protected by a firewall and WAF. Transient and temporary information is additionally erased automatically after 24 hours or by the customer using the API delete endpoint. All our workstations are encrypted and further protected by Anti Virus with Endpoint Detection & Response (EDR).
ISO certificates: 27018:2019, 27017:2015, 27001:2013
In case of a data or security breach, we may, at our discretion and as required by the applicable law and regulations, update a user about the relevant details of such an event.
Yes. The servers and relevant data are mirrored and / or backed-up in real time as part of the AWS platform service.
Access Permissions are handled by D-ID. There is a complete separation between our development and testing environments and the production environment. Only certified personnel may access the production environment. Further, based on their credentials, users may only access their own data. Credentials are revoked on a regular basis when/if needed.