o ûÇh>> # xdoctest: +SKIP >>> sum_reducer = _CustomReducer( >>> torch.tensor(0.0), >>> lambda a, b: a + b >>> ) cCs||_||_dS©N)Ú init_valueÚ reduce_fn)Úselfr r©rú{/var/www/html/construction_image-detection-poc/venv/lib/python3.10/site-packages/torch/distributed/pipelining/microbatch.pyÚ__init__(s z_CustomReducer.__init__N)Ú__name__Ú __module__Ú__qualname__Ú__doc__rrrrrrsrc@óeZdZdS)Ú_LossReducerN©rrrrrrrr-órgcCs||Srr)ÚaÚbrrrÚ1órc@sfeZdZUdZdd„Zeed<dd„Zdd„Ze d e ed ffdd„ƒZe d ee effd d„ƒZdS)rz2 Class used to specify chunking of inputs cCs ||_dSr©Ú split_dim)rr rrrr=s zTensorChunkSpec.__init__r cCs |jj›d|jj›d|j›dS)NÚ.ú(ú))Ú __class__rrr ©rrrrÚ__repr__BsÿzTensorChunkSpec.__repr__cCsd|j›dS)NzTensorChunkSpec(r#rr%rrrÚ__str__GszTensorChunkSpec.__str__Ú chunk_dims.cCót|dd„ƒ}|S)aŠ A helper for creating a tuple of `TensorChunkSpec` from a tuple of chunk dimensions (int's). Example: >>> # xdoctest: +SKIP >>> # There are three positional arguments to the model, and >>> # we are chunking them along dimension 0, 0 and 1, respectively >>> args_chunk_spec = TensorChunkSpec.from_tuple((0, 0, 1)) cSót|ƒSr©r©ÚdimrrrrYrz,TensorChunkSpec.from_tuple..r)r(Úargs_chunk_specrrrÚ from_tupleJs þzTensorChunkSpec.from_tuplecCr))a\ A helper for creating a dictionary of `TensorChunkSpec` from a dictionary of chunk dimensions (int's). Example: >>> # xdoctest: +SKIP >>> # Chunk dimension 0 for the "id" argument, 1 for the "mask" argument >>> kwargs_chunk_spec = TensorChunkSpec.from_dict({"id": 0, "mask": 1}) cSr*rr+r,rrrrkrz+TensorChunkSpec.from_dict..r)r(Úkwargs_chunk_specrrrÚ from_dict]s þzTensorChunkSpec.from_dictN)rrrrrÚintÚ__annotations__r&r'ÚstaticmethodÚtupler/ÚdictÚstrr1rrrrr8s ÿ ÿrc@r)Ú _ReplicateNrrrrrr8qrr8c!sÖi}g}|}d}t|ƒt|ƒks"Jdt| ¡ƒ›dt| ¡ƒ›ƒ‚| ¡D]ê\}}t|ƒ\} } | | ¡||}|dus?J‚t|ƒ\}} t| ƒt|ƒkrWtd|›d|›ƒ‚g}t| |ƒD]\}}|tuslt |t jƒsu| |g|¡q^t |tƒrt |t jƒsˆJ|›dƒ‚| |j¡}||kr´|r¦t d|›d |›d |›d¡|}ntd|›d |›d|›dƒ‚t |||j¡}trýg}d}|D]2}t |¡}|| |j¡}tdddƒg|j}t||ƒ||j<|||<| |¡|| |j¡7}qÄ| |¡n| |¡d}q^td|›ƒ‚|||<q&g}t|ƒD]!‰i}| ¡D]\}}‡fdd„|Dƒ}|||<q| |¡qg}|D]+}i}t|ƒt|ƒksLJ‚t| ¡|ƒD]\\}}} t|| ƒ||<qS| |¡q=|S)aW Given a dictionary of args, and a dictionary of chunking specs, shard the args according to the chunking specs. Args: args_dict: Dictionary of args args_chunk_spec: Dictionary of chunking specs num_chunks: Number of chunks to shard the args into Returns: args_split: List of sharded args Tzargs_dict.keys() = z args_chunk_spec.keys() = NzArgument value z9 did not have the same number of values as as chunk spec z is not a tensorz%Tensor size on chunking dimension is z', downsizing the number of chunks from z to r!zArg z% on chunking dimension has a size of z$, smaller than the number of chunks zŒ. PiPPy cannot reduce the number of chunks because other arguments have bigger chunk-dimension sizes. Please adjust your num_chunks setting.rFzUnrecognized chunk spec: csg|]}|ˆ‘qSrr)Ú.0Úv_flat©Ú chunk_idxrrÚ ãsz'_shard_dict_of_args..)ÚlenÚlistÚkeysÚitemsrÚappendÚ ValueErrorÚzipr8Ú isinstanceÚtorchÚTensorrÚsizer ÚloggerÚwarningÚRuntimeErrorÚtensor_splitÚ_debug_mask_minibatchesÚ zeros_likeÚsliceÚndimÚ TypeErrorÚranger)!Ú args_dictr.Ú num_chunksÚargs_sharded_replicatedÚ arg_specsÚreal_num_chunksÚfirst_tensorÚarg_keyÚargÚflatÚspecÚ chunk_specÚchunk_spec_flatÚ_Úsharded_arg_flatÚvÚchunk_vÚv_split_dim_sizeÚ chunk_tensorsÚexpanded_chunksÚ split_dim_idxÚchunk_tensorÚnew_valÚ upper_idxÚ slice_indicesÚchunks_flatÚ chunk_argsÚkeyÚarg_single_chunkÚ args_splitÚchunkÚper_chunk_argsÚarg_specrr;rÚ_shard_dict_of_argsusšÿ ÿÿÿÿÿÿÿÿ ÿ rsÚargs.ÚkwargsÚchunksr.r0Úreturnc Csà|duri}|durttƒft|ƒ}|durt |ttƒ¡}ttt|ƒƒtt|ƒƒ|ƒ}t|ƒ}t|||ƒ}t|ƒ|krOt|ƒ}ttt|ƒƒtt|ƒƒ|ƒ}t|ƒt|ƒkretdt|ƒ›dt|ƒ›ƒ‚dd„|Dƒ}||fS)a Given a sequence of args and kwargs, split them into a number of chunks according to their respective chunking specs. Args: args: Tuple of args kwargs: Dict of kwargs chunks: Number of chunks to split the args and kwargs into args_chunk_spec: chunking specs for args, in same shape as args kwargs_chunk_spec: chunking specs for kwargs, in same shape as kwargs Returns: args_split: List of sharded args kwargs_split: List of sharded kwargs Nz;args and kwargs are split into different number of chunks: z, cs*g|]‰t‡fdd„ttˆƒƒDƒƒ‘qS)c3s|]}ˆ|VqdSrr)r9Úi©rlrrÚ Vs€z;split_args_kwargs_into_chunks...)r5rRr>)r9rryrr=Usÿÿz1split_args_kwargs_into_chunks..)rÚDEFAULT_CHUNK_DIMr>r6ÚfromkeysrsÚ enumeraterK) rtrurvr.r0Úargs_split_dictrWÚkwargs_splitrorrrr ôsH8 ýý ýÿÿÿþr cs0|durt|ƒ\}}nt|dƒ\}}ttƒgt|ƒ}g‰|D]}t|ƒ\}}t|ƒt|ƒkr:td|›d|›ƒ‚ˆ |¡q g}t|ƒD]Ì\‰} t| tƒrÑ‡‡fdd„ttˆƒƒDƒ} t rÃ| dj }| dd…D] }|j |kssJ‚qjtjtj |dd iŽt| ƒ| jd } g}d}t| ƒt| ƒks“J‚t| | ƒD])\}}|| | j¡}tdddƒg|j}t||ƒ|| j<||}| |¡|}q˜n| }| tj|| jd¡qFt| tƒrò| j}ttˆƒƒD]}| |ˆ|ˆ¡}qß| |¡qFˆdˆ}tdtˆƒƒD] }ˆ|ˆ|ksJ‚qÿ| |¡qFt||ƒS)zæ Given a list of chunks, merge them into a single value according to the chunk spec. Args: chunks: list of chunks chunk_spec: Chunking spec for the chunks Returns: value: Merged value NrzChunk z did not match chunk spec csg|]}ˆ|ˆ‘qSrr)r9r<©Úarg_idxÚchunks_flattenedrrr=£s ÿÿz merge_chunks..éÚdeviceÚmeta)Úsectionsr-r,)rrr{r>rCrBr}rErRrMÚshaperFrLÚemptyr rDrHrOrPÚcatrr rr)rvr]Úspec_flattenedÚflatten_specÚchunk0_flatrpÚchunk_flattenedr_Úargs_flattenedrZÚpartial_valuesÚ overall_shapeÚvalÚmeta_chunksÚ values_to_catÚchunk_start_idxÚ partial_valueÚ meta_chunkÚ chunk_end_idxrjÚslicedÚreduced_valr<Úvaluerr€rr ]sd- þ ý ø ÿ r )NN)ÚloggingÚtypingrrrFÚ torch.fx.noderÚtorch.utils._pytreerrÚ__all__Ú getLoggerrrIrMrrÚtensorÚsum_reducerr{rr8rsr5r6r7r2r?r r rrrrÚsF 9û ÿþýüû úiÿ