🎬Videokit Tools
The Videokit is a collection of glifs that allows you to create and edit different kinds of videos.
These glifs were designed to power-up your ability to create short-form videos.
Below you will find workflows that are ready for Glifchat use. This means that the glifs have all their input descriptions filled in so that your bot knows how to use them as skills and create videos end-to-end.
Under the hood, the glifs are communicating with a custom backend. At the moment, you need a special API key in order to run the functions. If you're still curious about all of the capabilities, you can check the swagger docs here.
Below are some pre-built glifs, ranging from high-level to low-level functions.
We convert everything to mp4
format. If your input videos have a different encoding, rendering could take a longer time, and often result in a error. For best results, pass urls of mp4
videos.
High-level glifs
🎬 Create captioned tiktok video (single)
Creates a TikTok video given a video url and a voice url. Text will be overlayed on top of the video at the selected location (top, center, bottom). If the video is shorter than the voice clip, then the video will loop. If the video is longer than the voice clip, then it will shorten the video length to match.
video_url: url to the video
voice_url: url to the voice that will be transcribed
text_position: where to put the text
Example:
🎬 Create tiktok video (split screen)
Creates a tiktok video given a video url, a voice url and an extra video url to put below the first video. Text will be shown on top of the video at the selected location (top, center, or bottom). If the video is shorter than the voice clip, then the video will loop. If the video is longer than the voice clip, then it will shorten the video length to match.
video_url: url to the video
voice_url: url to the voice that will be transcribed
text_position: where to put the text
bottom_video_url: url to the video to show at the bottom
Example:
🎬 Create tiktok video (fast)
Creates a tiktok video given image urls and and a voice url. Captions will be overlayed on top of the images at the selected location (center, top, bottom). You do not need to worry about setting the duration of the images for the video to match the audio length, we do that all magically for you 🎉. OPTIONAL: You can also overlay audio by passing a audio file mp4!
image_urls: list of image urls
voice_url: url to the voice that will be transcribed
song_url: optional background music url
text_position: where to put the text
highlight_color: color of the text highlighting
add_watermark: add Glif watermark in corner
Example:
Medium-level tools
🎬 Create video from asset links
Create a video from a combination of images, videos and/or audio. Clip AND loop options are mutually exclusive. Stay around 480p or less. Animated gifs need to be added to video urls, not image urls. Bools should be "true" or "false". If no audio is given, it will use the audio from video clips if possible.
image_duration: duration of each image in seconds
image_urls: list of image urls
video_urls: list of video urls
audio_url: url to the voice that will be transcribed
height: height of the video
width: width of the video
clip_length_to_audio: clip video to audio
clip_length_to_video: clip audio to video
loop_audio: loop audio until length of video
loop_video: loop video until length of audio
🎬 Create video from video/images
Create a video from a combination of images, videos and/or audio, and keeps the respective order. Clip options are mutually exclusive. Stay around 480p or less. Bools should be "true" or "false". If no audio is given, it will use the audio from video clips if possible.
asset_urls: list of urls to images and videos
image_duration: duration of each image in seconds
audio_url: url to the voice that will be transcribed
height: height of the video
width: width of the video
clip_length_to_audio: clip video to audio
clip_length_to_video: clip audio to video
Low-level Tools
🎬 Analyze media asset
Returns detailed information about an image, video or audio asset.
asset_url: url to image, video or audio
Example:
{
"asset_url_or_path": "https://res.cloudinary.com/dkpfhyd71-comfy/video/upload/v1747423113/glif_worker_general/2c3e4cdf-f066-48ba-aa18-d54915a6d1b5.mp3",
"format":"mp3",
"size":87597,
"type":"audio",
"duration":5.448,
"bit_rate":128629,
"audio_codec":"mp3",
"sample_rate":24000,
"channels":1,
"channel_layout":"mono",
"audio_bit_rate":128000
}
🎬 Create speech from text
Creates an audio clip from given text using speech-kokoro
with Adam - American Male
. Returns a url to audio clip.
text: text to be transcribed
Example:
🎬 Put text on image
This tool puts custom text on top of an image an returns it in aspect ratio 9:16. Example:

🎬 Combine videos
Combine two videos either horizontally or vertically.
video_1_url: url to left/up video
video_2_url: url to right/bottom video
use_audio_from_first_video: use audio for first or not
layout: horizontal or vertical
loop_to_longest: loop to the longest video
height: final height
width: final width
fit_mode: pad or cover
Examples:
🎬 Add subtitles to video
Adds subtitles based on a voice clip. It transcribes the audio clip with time markings so that it can show the text in a similar rhythm as the speech.
video_url: url to video to show
voice_url: url to voice clip
text_position: position of text
add_voice_audio: also add the voice audio
highlight_color: color of the text highlighting
Example:
🎬 Mix audio to video
This tool overlays audio onto an existing video. The audio will be snipped to the video length.
video_url: url to video to show
audio_url: url to the audio
mix_factor_video: True
mix_factor_audio: True
Example:
🎬 Display video from url
This tool puts a video html object into an artifact. Mostly used for glifchat artifacts.
video_url: url to video to show
Last updated