openvino_genai.ContinuousBatchingPipeline#
- class openvino_genai.ContinuousBatchingPipeline#
Bases:
pybind11_object
This class is used for generation with LLMs with continuous batchig
- __init__(*args, **kwargs)#
Overloaded function.
__init__(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, models_path: str, scheduler_config: openvino_genai.py_openvino_genai.SchedulerConfig, device: str, properties: dict[str, object] = {}, tokenizer_properties: dict[str, object] = {}) -> None
__init__(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, models_path: str, tokenizer: openvino_genai.py_openvino_genai.Tokenizer, scheduler_config: openvino_genai.py_openvino_genai.SchedulerConfig, device: str, properties: dict[str, object] = {}) -> None
Methods
__delattr__
(name, /)Implement delattr(self, name).
__dir__
()Default dir() implementation.
__eq__
(value, /)Return self==value.
__format__
(format_spec, /)Default object formatter.
__ge__
(value, /)Return self>=value.
__getattribute__
(name, /)Return getattr(self, name).
__gt__
(value, /)Return self>value.
__hash__
()Return hash(self).
__init__
(*args, **kwargs)Overloaded function.
This method is called when a class is subclassed.
__le__
(value, /)Return self<=value.
__lt__
(value, /)Return self<value.
__ne__
(value, /)Return self!=value.
__new__
(**kwargs)Helper for pickle.
__reduce_ex__
(protocol, /)Helper for pickle.
__repr__
()Return repr(self).
__setattr__
(name, value, /)Implement setattr(self, name, value).
Size of object in memory, in bytes.
__str__
()Return str(self).
Abstract classes can override this to customize issubclass().
add_request
(*args, **kwargs)Overloaded function.
generate
(*args, **kwargs)Overloaded function.
get_config
(self)get_metrics
(self)get_tokenizer
(self)step
(self)- __class__#
alias of
pybind11_type
- __delattr__(name, /)#
Implement delattr(self, name).
- __dir__()#
Default dir() implementation.
- __eq__(value, /)#
Return self==value.
- __format__(format_spec, /)#
Default object formatter.
- __ge__(value, /)#
Return self>=value.
- __getattribute__(name, /)#
Return getattr(self, name).
- __gt__(value, /)#
Return self>value.
- __hash__()#
Return hash(self).
- __init__(*args, **kwargs)#
Overloaded function.
__init__(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, models_path: str, scheduler_config: openvino_genai.py_openvino_genai.SchedulerConfig, device: str, properties: dict[str, object] = {}, tokenizer_properties: dict[str, object] = {}) -> None
__init__(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, models_path: str, tokenizer: openvino_genai.py_openvino_genai.Tokenizer, scheduler_config: openvino_genai.py_openvino_genai.SchedulerConfig, device: str, properties: dict[str, object] = {}) -> None
- __init_subclass__()#
This method is called when a class is subclassed.
The default implementation does nothing. It may be overridden to extend subclasses.
- __le__(value, /)#
Return self<=value.
- __lt__(value, /)#
Return self<value.
- __ne__(value, /)#
Return self!=value.
- __new__(**kwargs)#
- __reduce__()#
Helper for pickle.
- __reduce_ex__(protocol, /)#
Helper for pickle.
- __repr__()#
Return repr(self).
- __setattr__(name, value, /)#
Implement setattr(self, name, value).
- __sizeof__()#
Size of object in memory, in bytes.
- __str__()#
Return str(self).
- __subclasshook__()#
Abstract classes can override this to customize issubclass().
This is invoked early on by abc.ABCMeta.__subclasscheck__(). It should return True, False or NotImplemented. If it returns NotImplemented, the normal algorithm is used. Otherwise, it overrides the normal algorithm (and the outcome is cached).
- add_request(*args, **kwargs)#
Overloaded function.
add_request(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, request_id: int, input_ids: openvino._pyopenvino.Tensor, sampling_params: openvino_genai.py_openvino_genai.GenerationConfig) -> openvino_genai.py_openvino_genai.GenerationHandle
add_request(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, request_id: int, prompt: str, sampling_params: openvino_genai.py_openvino_genai.GenerationConfig) -> openvino_genai.py_openvino_genai.GenerationHandle
- generate(*args, **kwargs)#
Overloaded function.
generate(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, input_ids: list[openvino._pyopenvino.Tensor], sampling_params: list[openvino_genai.py_openvino_genai.GenerationConfig], streamer: Union[Callable[[str], bool], openvino_genai.py_openvino_genai.StreamerBase, None] = None) -> list[openvino_genai.py_openvino_genai.EncodedGenerationResult]
generate(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline, prompts: list[str], sampling_params: list[openvino_genai.py_openvino_genai.GenerationConfig], streamer: Union[Callable[[str], bool], openvino_genai.py_openvino_genai.StreamerBase, None] = None) -> list[openvino_genai.py_openvino_genai.GenerationResult]
- get_config(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline) openvino_genai.py_openvino_genai.GenerationConfig #
- get_metrics(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline) openvino_genai.py_openvino_genai.PipelineMetrics #
- get_tokenizer(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline) openvino_genai.py_openvino_genai.Tokenizer #
- has_non_finished_requests(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline) bool #
- step(self: openvino_genai.py_openvino_genai.ContinuousBatchingPipeline) None #