For each type conversion there is specific operation. In your case The VI sets the destination value to the average of the three color components of the source.
For more:
http://zone.ni.com/reference/en-XX/help/370281P-01/imaqvision/casting_images/
For more:
http://zone.ni.com/reference/en-XX/help/370281P-01/imaqvision/casting_images/