get_steps_per_epoch fails if block_shape is None #278

hvgazula · 2024-03-04T22:06:33Z

The get_steps_per_epoch method of the nobrainer.dataset.Dataset class assumes blocks are created from the input images. This method needs refactoring to handle the block_shape = None case.

nobrainer/nobrainer/dataset.py

Lines 249 to 266 in ed0d609

    
           def get_steps_per_epoch(self): 
        
               def get_n(a, k): 
        
                   return (a - k) / k + 1 
        
               n_blocks = tuple( 
        
                   get_n(aa, kk) for aa, kk in zip(self.volume_shape, self.block_shape) 
        
               ) 
        
               for n in n_blocks: 
        
                   if not n.is_integer() or n < 1: 
        
                       raise ValueError( 
        
                           "cannot create non-overlapping blocks with the given parameters." 
        
                       ) 
        
               n_blocks_per_volume = np.prod(n_blocks).astype(int) 
        
               steps = n_blocks_per_volume * self.n_volumes / self.batch_size 
        
               steps = math.ceil(steps) 
        
               return steps

The text was updated successfully, but these errors were encountered:

hvgazula · 2024-03-05T22:11:35Z

won't fail. all good.

nobrainer/nobrainer/dataset.py

Lines 241 to 243 in ed0d609

    
           @property 
        
           def block_shape(self): 
        
               return tuple(self.dataset.element_spec[0].shape[1:4].as_list())

hvgazula · 2024-03-06T00:17:07Z

fails if n_volumes is not set which is the case when the from_tfrecords function is called.

hvgazula · 2024-03-06T04:02:07Z

this line

nobrainer/nobrainer/dataset.py

Line 122 in ed0d609

block_length = len([0 for _ in first_shard])

is a huge bottleneck in the code. For context, please refer to https://stackoverflow.com/questions/70992022/how-to-get-the-correct-cardinality-of-a-tensorflow-dataset-after-filtering

hvgazula · 2024-03-12T17:49:12Z

this is resolved by calculating the number of files within each shard. This case doesn't account for the situation where the last shard may have fewer files but that shouldn't impact the training process in any way.

hvgazula added the bug label Mar 4, 2024

hvgazula self-assigned this Mar 4, 2024

hvgazula closed this as completed Mar 5, 2024

hvgazula reopened this Mar 6, 2024

hvgazula added a commit that referenced this issue Mar 12, 2024

resolved #278

c67bd9e

This was referenced Mar 12, 2024

Dev #292

Closed

Dev #295

Merged

hvgazula linked a pull request Mar 20, 2024 that will close this issue

Dev #295

Merged

satra closed this as completed in #295 Mar 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

get_steps_per_epoch fails if block_shape is None #278

get_steps_per_epoch fails if block_shape is None #278

hvgazula commented Mar 4, 2024 •

edited

Loading

hvgazula commented Mar 5, 2024

hvgazula commented Mar 6, 2024

hvgazula commented Mar 6, 2024

hvgazula commented Mar 12, 2024

get_steps_per_epoch fails if block_shape is None #278

get_steps_per_epoch fails if block_shape is None #278

Comments

hvgazula commented Mar 4, 2024 • edited Loading

hvgazula commented Mar 5, 2024

hvgazula commented Mar 6, 2024

hvgazula commented Mar 6, 2024

hvgazula commented Mar 12, 2024

hvgazula commented Mar 4, 2024 •

edited

Loading