Skip to main content
1 vote
2 answers
635 views

I have built a pipeline that read some data, does some manipulations and create some apache beam Row objects (Steps 1 and 2 in the code below). I then would like to generate statistic and write them ...
0 votes
1 answer
284 views

I'm using Python 3.12, and I'm trying to install the latest version (1.15.1) of TensorFlow Data Validation. However, while installing, I encountered the following error: ERROR: Could not find a ...
3 votes
4 answers
4k views

["referring to the data shown in the image while using the filter function and getting the return values, How can I return blank cells as it is instead of printing as Zeros while using filter ...
0 votes
2 answers
130 views

I am stucked in getting this block of statement.In my serving sets there are some anomalies. I am not getting what this below code is doing in removing the anomalies. payer_code = tfdv.get_feature(...
0 votes
1 answer
100 views

By the following vba excel code which I found on the internet, I can select more than one option within a cell from the dv dropdown list at the same time. What I am trying to do is to add a checkmark ...
0 votes
1 answer
259 views

I want to create a new TensorFlow Data Validation schema from scratch with fixed features name, type and presence. import tensorflow_data_validation as tfdv from tensorflow_metadata.proto.v0 import ...
1 vote
1 answer
421 views

I am presently trying to feed a dataset with a few multivalent feature columns through a TensorFlow Extended (TFX) pipeline. Here is a row from my sample data: user_id 29601 ...
1 vote
1 answer
453 views

I'm using TFX (more precisely TensorFlow Data Validation) with the infer_schema method documented there https://www.tensorflow.org/tfx/data_validation/api_docs/python/tfdv/infer_schema. It generates a ...
-1 votes
1 answer
437 views

When I import tensorflow data validation using import tensorflow_data_validation as tfdv I get ModuleNotFoundError: No module named 'tensorflow_data_validation' When I want to install it using !pip ...
0 votes
1 answer
412 views

I'm trying to install tensorflow_data_validation but it shows the below error. it says that i need Python version 2.7 or 3.4+ but I'm using the version 3.10.9. Also I have the 2.11.0 version of ...
4 votes
1 answer
615 views

I want to infer a schema with tensorflow data validation (tfdv) based on a pandas dataframe of the training data. The dataframe contains a column with a multivalent feature, where multiple values (or ...
0 votes
1 answer
117 views

a superimposed display for train/val splits using StatisticsGen Hi, I'm currently using tfx pipeline inside kubeflow. I struggle to have StatisticsGen showing a single graph with train and validation ...
0 votes
4 answers
888 views

It is puzzling to me that there is a tfdv.load_statistics() function, but no corresponding tfdv.write_statistics() function. How do I go about saving the statistics, and then loading them again? e.g. ...
5 votes
1 answer
816 views

Question: Is it possible to run tensorflow-data-validation on MacOS with M1 chip? Steps taken: I have created a conda environment (tfdv38) in which I have installed the Mac-optimized TensorFlow. I ...
0 votes
1 answer
361 views

I'm using Tensorflow Data Validation to generate stats from the data and infer an schema to input in TFX. I didn't find any option to specify the NaN values, e. g., in pandas there is a field "...

15 30 50 per page