You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
*Description*: AI inference is the process in which a trained model is loaded into memory and then makes predictions based on input data. For example, "The Llama-3.2-90B-Vision-Instruct-FP8-dynamic model performs inference to identify objects in an image."
354
+
355
+
*Use it*: yes
356
+
357
+
[.vale-ignore]
358
+
*Incorrect forms*:
359
+
360
+
*See also*:
361
+
362
+
[[inferencing]]
363
+
==== image:images/yes.png[yes] inferencing (noun)
364
+
*Description*: _Inferencing_ is the active process of running a trained AI model against input data to produce outputs such as predictions, classifications, or generated output text.
*Description*: In Red{nbsp}Hat Process Automation Manager and Red{nbsp}Hat Decision Manager, the _inference engine_ is a part of the Red{nbsp}Hat Decision Manager engine, which matches production facts and data to rules. It is often called the brain of a production rules system because it is able to scale to a large number of rules and facts. It makes inferences based on its existing knowledge and performs the actions based on what it infers from the information.
@@ -359,6 +381,29 @@ There is no functional difference between the first server that was installed an
*Description*: In Red Hat OpenShift AI, this is the custom resource definition (CRD) used to create the `InferenceService` object. When referring to the CRD name, use `InferenceService` in monospace.
*Description*: _Inference serving_ is the process of deploying and hosting a large language model on a serving platform so that it can receive and respond to inference requests.
399
+
400
+
*Use it*: yes
401
+
402
+
[.vale-ignore]
403
+
*Incorrect forms*:
404
+
405
+
*See also*:
406
+
362
407
[[infiniband]]
363
408
==== image:images/yes.png[yes] InfiniBand (noun)
364
409
*Description*: _InfiniBand_ is a switched fabric network topology used in high-performance computing. The term is both a service mark and a trademark of the InfiniBand Trade Association. Their rules for using the mark are standard ones: append the (TM) symbol the first time it is used, and respect the capitalization (including the inter-capped "B") from then on. In ASCII-only circumstances, the "\(TM)" string is the acceptable alternative.
0 commit comments