Understanding the ChatGPT API Parameters

Technology Blogs

Training and running an AI model can be a tedious process. It can involve dealing with large amounts of quality data and working with complex and computationally expensive mathematical algorithms. So it makes much more sense to make use of the well-trained and tested models that are already available through ChatGPT APIs. Integrating such AI capabilities has been proven to significantly enhance user experience and speed up a lot of online processes.

The ChatGPT API is a powerful tool that leverages artificial intelligence to generate human-like text based on given prompts. This API has found applications in various fields, including customer service, content creation, coding, and more. Understanding the key parameters of the ChatGPT API and their mathematical significance can greatly enhance the quality and relevance of the generated text. Let us delve deeper into some of the important parameters that help us control the responses from the OpenAI models.

Model

Specifies the version of the AI model to use like gpt-4, gpt-3.5-turbo etc.
OpenAI periodically updates its models, each offering improvements in understanding and generating text.
The choice of model affects the quality of responses, with newer models generally providing more accurate and contextually relevant completions.

Messages (Prompt)

🔸 This is the input list of messages to which the model responds.

🔸 The prompt sets the context and specifies the task for the ChatGPT API. It can be seen as the initial condition for the generative process, where the model applies conditional probability to generate the next sequence of words.

🔸 Its value can be a list of objects describing:

Role – It is used to guide the model’s response towards a specific conversational role or persona. It influences the tone, style, and content of the generated text to better align with the desired conversational context.
Content – Contains a wide range of textual content that serves as input to the ChatGPT API. It can be questions, commands, instructions, statements, etc. Some models like gpt-4-vision-preview also accept image input URLs.
Name – An optional name for the participant. Provides the model information to differentiate between participants of the same role.

🔸 It’s important to provide clear and specific instructions or content that guides the ChatGPT API’s behavior and helps it generate relevant and accurate responses. Experimenting with different types of content and instructions can help achieve the desired results and tailor the model’s responses to our specific needs and preferences.

🔸 In the case of chat applications, to ensure contextual awareness in the response of the model, it is necessary to also include the conversation history of a chat session.

Tokens

🔸 Tokenization is one of the primitive steps in any NLP (Natural Language Processing) pipeline where an input text is broken down into smaller units before it gets processed by a language model.

🔸 A token is a group of typically 1 to 4 characters that do not necessarily form a meaningful word and are derived from common character combinations that are observed across languages.

🔸 ChatGPT API’s vast database stores distinct tokens, each assigned with a unique identifier or token ID based on its frequency in the training data. The higher the frequency of a token in the training data, the lower will be its ID value.

🔸 Check out OpenAI’s tokenizer tool to understand how a piece of text might be tokenized using different algorithms.

🔸 Open AI charges its ChatGPT API users based on the total number of tokens exchanged in a request i.e., the sum of tokens in the request/prompt and the model’s response.

🔸 The parameter “max_tokens” controls the maximum number of tokens that can be exchanged in a request and thus modulates the cost per request and the concision of a model’s response.

Temperature

🔸 Temperature-based sampling algorithms use the softmax function to generate a probability distribution for a set of predicted tokens and the “temperature” parameter is used to adjust the sampling process by scaling the logits (log probabilities).

🔸 The temperature value scales the logits before applying softmax during text generation, affecting the probability distribution of the next word. A higher temperature value results in a flatter probability distribution, making it more likely for the model to select less probable tokens, resulting in more creative output.

Softmax(zi) = exp(zi) / jexp(zj)

Temperature Scaling =>

Softmax(zi) = exp(zi / Temperature) / jexp(zj / Temperature)

Temperature-Probability — Fig: Temperature Probability

🔸 At a low temperature (e.g., 0.2), the model’s responses are likely to be conservative and similar across runs. As we increase the temperature to a high value (e.g., 0.9), the responses become more diverse but may also include more surprising or less predictable elements.

🔸 Typical use cases for different values of temperature:

Low Temperature (Close to 0): Factual content, Code generation, Formal communications, etc.
Medium Temperature (0.5-0.8): General content, Educational explanations, Customer support, etc.
High Temperature (1 and above): Creative writing, Brainstorming, etc.

Top-p

🔸 The “top_p” parameter value determines the threshold for the sampling algorithm which would consider the most probable tokens that collectively make up the probability mass not exceeding the threshold value.

🔸 The top-p sampling strategy truncates the cumulative probability distribution, allowing only a subset of tokens to be considered for selection based on their cumulative probability. The resulting probabilities of the subset are then re-normalized so that they sum up to 1.

🔸 Applying a top-p value of 80% modifies the original distribution by considering only the top tokens whose cumulative probability adds up to 80%. This filters out the least likely tokens while retaining a significant portion of the distribution, allowing for diversity in the generated text but with a focus on more probable tokens.

🔸 A top-p value of 50% results in a more concentrated selection of tokens, with only the most probable tokens being considered. This significantly reduces the diversity of the output but increases the likelihood that the generated text will be coherent and on-topic, as it relies on the most probable tokens.

🔸 Both the parameters Temperature and Top-p can be used to control the diversity of the model responses. Temperature does so by affecting all tokens equally across the distribution whereas, Top-p does so by dynamically adjusting the range of considered tokens based on the desired cumulative probability.

Optimize Your Text Generation with ChatGPT API. Elevate Your Content to New Heights. Explore Now!

Get in Touch

Frequency and Presence Penalties

🔸 The Frequency penalty parameter penalizes a token or a sequence of tokens based on how often it has already been sampled, whereas the Presence penalty parameter penalizes a token or a sequence of tokens based on whether it has already been sampled at least once.

🔸 They penalize by directly modifying the logits (un-normalized log probabilities) with an additive contribution as shown below.

mu[j] -> mu[j] - c[j] * alpha_frequency - float(c[j] > 0) * alpha_presence

Where,

mu[j] is the logits of the j-th token
c[j] is how often that token was sampled before the current position
alpha_frequency is the frequency penalty coefficient
alpha_presence is the presence penalty coefficient

🔸 Penalty values can be between -2.0 to 2.0 and a positive value can be used to decrease the likelihood of repetition. Note that a high coefficient value close to 2 can strongly suppress repetition but can also degrade the quality of samples.

Logarithm of Probability

🔸 The “logprobs” parameter specifies the number of most likely tokens and their respective log probabilities to be returned in the ChatGPT API’s response for every token sampled in the completion.

🔸 The log probability value of a token is the natural logarithm of its probability value and the probabilities that are very close to zero are very negative in their logarithmic form.

🔸 Probability values in their raw form can be very small and difficult to work with, but their logarithmic values can be more manageable and computationally efficient.

🔸 Log probabilities can provide analytical insight into the capabilities of a language model based on which, an additional level of fine control and flexibility can be attained in generating optimized responses.

Logit Bias

🔸 Logits are the unnormalized numerical values produced by the model’s last linear layer that represent the model’s confidence over each possible class in a classification task. The logits are often passed through an activation function (such as softmax) to produce a probability distribution over the classes.

🔸 The “logit_bias” parameter accepts a JSON object that maps token IDs to bias values (-100 to 100). The specified bias values are added to the logits generated by the model for the respective tokens, thus affecting their likelihood of getting sampled.

🔸 The parameter can be used to influence a model’s preference for generating or avoiding a particular set of tokens in its response. For example, a collection of tokens related to a desired topic can be assigned with positive bias values to encourage the language model to generate text related to the topic more frequently.

Tools

🔸 A list of functions intended to be called upon receiving a response from the model.

🔸 The model returns a JSON object containing all the appropriate values for the arguments of a function.

🔸 They can be used to customize the post-processing of the ChatGPT API’s responses to meet specific requirements, enhance the quality of the generated text, and ensure that the output is suitable for the intended application or audience.

Stream

🔸 It provides flexibility in controlling the behavior of the ChatGPT API response, allowing us to choose between real-time streaming of the generated tokens or receiving the complete response as a single block of text based on our specific requirements and use case.

🔸 Models trained by the ChatGPT API can take a considerable amount of time to generate responses, and real-time streaming can be beneficial for applications that require immediate feedback or continuous interaction with the model during the generation process.

Conclusion

Understanding and effectively utilizing the parameters of the ChatGPT API can significantly enhance the quality and relevance of the generated text. By carefully adjusting parameters like temperature, top_p, and max_tokens, users can tailor the model’s output to meet specific needs, balancing creativity, diversity, and coherence. For much more fine-grained control, parameters like frequency and presence penalties and logit bias can be used by influencing the likelihood of specific tokens.

Suhail Chand

Full Stack Developer

Suhail is a full-stack web developer with 2+ years of professional experience in designing and implementing robust web applications using frameworks such as Django and Angular. He’s also a passionate data science enthusiast with a keen interest in transforming raw data into actionable intelligence. Outside the realm of coding, I am an avid cinephile with a deep appreciation for cinema from around the world.

Service
Career

Let's create something together!
We’re looking for the best. Are you in?

We worked with Mindbowser on a design sprint, and their team did an awesome job. They really helped us shape the look and feel of our web app and gave us a clean, thoughtful design that our build team could...

Scriptyak Founder

The team at Mindbowser was highly professional, patient, and collaborative throughout our engagement. They struck the right balance between offering guidance and taking direction, which made the development process smooth. Although our project wasn’t related to healthcare, we clearly benefited...

Dan Barnes

Founder, Texas Ranch Security

Mindbowser played a crucial role in helping us bring everything together into a unified, cohesive product. Their commitment to industry-standard coding practices made an enormous difference, allowing developers to seamlessly transition in and out of the project without any confusion....

David Hoffman

CEO, MarketsAI

I'm thrilled to be partnering with Mindbowser on our journey with TravelRite. The collaboration has been exceptional, and I’m truly grateful for the dedication and expertise the team has brought to the development process. Their commitment to our mission is...

Marc Ott

Founder & CEO, TravelRite

The Mindbowser team's professionalism consistently impressed me. Their commitment to quality shone through in every aspect of the project. They truly went the extra mile, ensuring they understood our needs perfectly and were always willing to invest the time to...

Spencer Barns

CTO, New Day Therapeutics

I collaborated with Mindbowser for several years on a complex SaaS platform project. They took over a partially completed project and successfully transformed it into a fully functional and robust platform. Throughout the entire process, the quality of their work...

David Rhodes

President, E.B. Carlson

Mindbowser and team are professional, talented and very responsive. They got us through a challenging situation with our IOT product successfully. They will be our go to dev team going forward.

Dan Munro

Founder, Cascada

Amazing team to work with. Very responsive and very skilled in both front and backend engineering. Looking forward to our next project together.

Anthony Lewis

Co-Founder, Emerge

The team is great to work with. Very professional, on task, and efficient.

Matthew Holsclaw

Founder, PeriopMD

I can not express enough how pleased we are with the whole team. From the first call and meeting, they took our vision and ran with it. Communication was easy and everyone was flexible to our schedule. I’m excited to...

Angela Boudreaux

Founder, Seeke

We had very close go live timeline and Mindbowser team got us live a month before.

Shaz Khan

CEO, BuyNow WorldWide

If you want a team of great developers, I recommend them for the next project.

Vladimir Kudryavtsev

Founder, Teach Reach

Mindbowser built both iOS and Android apps for Mindworks, that have stood the test of time. 5 years later they still function quite beautifully. Their team always met their objectives and I'm very happy with the end result. Thank you!

Bart Mendel

Founder, Mindworks

Mindbowser has delivered a much better quality product than our previous tech vendors. Our product is stable and passed Well Architected Framework Review from AWS.

Pankaj Parashar

CEO, PurpleAnt

I am happy to share that we got USD 10k in cloud credits courtesy of our friends at Mindbowser. Thank you Pravin and Ayush, this means a lot to us.

Sudheer Bandaru

CTO, Shortlist

Mindbowser is one of the reasons that our app is successful. These guys have been a great team.

Dave Dubier

Founder & CEO, MangoMirror

Kudos for all your hard work and diligence on the Telehealth platform project. You made it possible.

Joyce Nwatuobi

CEO, ThriveHealth

Mindbowser helped us build an awesome iOS app to bring balance to people’s lives.

Addie Wootten

CEO, SMILINGMIND

They were a very responsive team! Extremely easy to communicate and work with!

Kristen M.

Founder & CEO, TotTech

We’ve had very little-to-no hiccups at all—it’s been a really pleasurable experience.

Chacko Thomas

Co-Founder, TEAM8s

Mindbowser was very helpful with explaining the development process and started quickly on the project.

Hieu Le

Executive Director of Product Development, Innovation Lab

The greatest benefit we got from Mindbowser is the expertise. Their team has developed apps in all different industries with all types of social proofs.

Alex Gobel

Co-Founder, Vesica

Mindbowser is professional, efficient and thorough.

MacKenzie Richter

Consultant, XPRIZE

Very committed, they create beautiful apps and are very benevolent. They have brilliant Ideas.

Laurie Mastrogiani

Founder, S.T.A.R.S of Wellness

Mindbowser was great; they listened to us a lot and helped us hone in on the actual idea of the app. They had put together fantastic wireframes for us.

Bennet Gillogly

Co-Founder, Flat Earth

Ayush was responsive and paired me with the best team member possible, to complete my complex vision and project. Could not be happier.

Katie Taylor

Founder, Child Life On Call

The team from Mindbowser stayed on task, asked the right questions, and completed the required tasks in a timely fashion! Strong work team!

Michael Wright

CEO, SDOH2Health LLC

Mindbowser was easy to work with and hit the ground running, immediately feeling like part of our team.

George Hodulik

CEO, Stealth Startup

Mindbowser was an excellent partner in developing my fitness app. They were patient, attentive, & understood my business needs. The end product exceeded my expectations. Thrilled to share it globally.

Jirina Harastova

Owner, Phalanx

Mindbowser's expertise in tech, process & mobile development made them our choice for our app. The team was dedicated to the process & delivered high-quality features on time. They also gave valuable industry advice. Highly recommend them for app development...

Marty Betz

Co-Founder, Fox&Fork